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The foil length clone shown in Figure 189 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 189; SEQ ID NO:271). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO1360 has a calculated molecular weight of approximately 31 ,433 daltons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 and is assigned ATCC deposit no. 203 157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the lull-length sequence shown in Figure 190 (SEQ ID NO:272), revealed sequence identity 
between the PRO1360 amino acid sequence and the following Dayhoff sequences: UN51_CAEEL, 
YD4B_SCHPO, AF000634_1, GFOZYMMO, YE1J_SCHP0, D86566J, ZMGFOJ, S76976, 
10 PPSA_SYNY3, and CEF28B14. 

EXAMPLE 84: Isolation of c DNA clones Encoding Human PRO 1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzymolory 266:46<M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
20 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T98880. the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translations initiation 
site at nucleotide positions 39-41 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9.548 dahons and a P I of about 8.52. Analysis of the 
30 full-length PRO1029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rhodopsins retinal binding site protein from about amino acid 50 to about amino acid 61, a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-g]ycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1 , 1998 
35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PRO1029 amino acid sequence and the following Dayhoff sequences: S66088. 
AF031815J, MM4A6L_1, PSEIS52a-l, S 17699 and P_R63635. 

EXAMPLE 85: Isolation of cDNA clones Encoding Human PROl 139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the lncyte database, designated 4461 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzymology 266:460^80 (1 996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 
15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PROl 139 (Figure 193). 
As noted before, the deduced amino acid sequence of DNA59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRPJ. The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28, and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86: Isolation of cDNA clones Encoding Human PRO 1 309 

An expressed sequence tag (EST) DNA database (LIFESEQ*, lncyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDNA clones encoding human PROl 309 were constnicted by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the SHI site; see. Holmes et al. , Science . 
30 253: 1278-1250 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described lncyte EST sequence: 

5--TCCGTGCAGGGGGACGCCTTTCAGAAACTGCGCCGAGTTAAGGAAC-3- (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translational initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 22*6-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 



446 



WO 99/63088 



PCT7US99/12252 



long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID NO:278. Clone DNA59588-1571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278), revealed sequence identity 
between the PRO1309 amino acid sequence and the following Dayhoff sequences: AB007876J , GPV MOUSE, 
ALS_RAT, P_R85889, LUM_CHICK, AB014462J, PGS1_CANFA, CEM88_7, A58532 and GEN11209. 

EXAMPLE 87: Isolation of cDNA Clones Encoding Human PRO1028 
10 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
15 Enzymoloqy 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 
contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-1419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The full-length PRO1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:281, the putative signal peptide is at about amino 
acids 1-19 of SEQ ID NO:281. An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO:281. A 
C-type lectin domain is at about amino acids 108-1 17 of SEQ ID NO:281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 
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EXAMPLE 88: Isolation of c DNA Clones Encoding Human PRO1027 

Use f the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 

In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 
contained within Incyte EST clone no. 937605, the Incyte EST clone no. 937605 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 
DNA59605-1418 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO1027 may be involved in tissue formation or repair. 

The following Dayhoff designations appear to have some sequence identity with PRO1027: 
SFT2_YEAST; ATM3E9_2; A69826; YM16_MARPO; E64896; U60193J; MTLRAJ205J; MCU60315_70; 
SPAS_SHIFL; and S54213. 



EXAMPLE 89 : Isolation of cDNA Clones Fnr- odine Human PRO! 107 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enaanology 266:460-180 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the Incyte EST clone no. 3203694 was purchased and the cDNA 
insert was btained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-1471 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PROl 107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase I/nucleotide phyrophosphatase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632J, 
P_R79148, RNU78787 1, AF060218_4, A57080 and HUMATXT l. 

20 

EXAMPLE 90: Isolation of cDNA clones Encoding Human PROl 140 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
EnzymoloRv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DNA564 16 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-1497 was deposited 
with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The full-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequence shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 

Analysis of the amino acid sequence of the full-length PROl 140 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PROl 140 amino acid sequence and the following 
Dayhoff sequences: AF023602 1, AF000368_1, CIN3_RAT, AF003373J, GEN13279, and AF003372J. 

Clone DNA59607-1497 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209946. 



EXAMPLE 91 : Isolation of cDNA clones Encoding Human PROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 

15 Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul etal., Methods in Enzvmolopv 266:460-480 (1996)1 Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
Univ. of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained within 
Incyte EST clone no. 1711247, Incyte EST clone no. 171 1247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59609- 1470, which 
is shown in Figure 205, is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 

25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1468-1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PROl 106 protein 

30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 

35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 106 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PROl 106 
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may be a n vel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PROl 106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161_1, IG002N01_25, GDC_BOVIN and BT1_MAIZE. 

EXAMPLE 92: Isolation of cDNA clones Encoding Human PRQ1291 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
10 et Methods in Enzymolog Y 266:46(1480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 
15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 
20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PR01291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 
25 amino acid 1 15, from about amino acid 160 to about amino acid 163, from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223. . Clone DNA59610-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552J, 
HSU90144J, AF033107J, HSB73J, HSU90142J, GGCD80J, P_W34452, MOG_MOUSE, B39371 and 
P_R71360. 

35 
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EXAMPLE 93: Isolation of cDNA clones Encoding Human PRO 11 OS 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program tt phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 
DNA59612-1466 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO:292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PRO1105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94: Isolation of cDNA clones Encoding Human PRQ51 1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
EnzYmolosY 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the pr gram "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light f an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1227491, the Incyte EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 21 1 and is herein designated as DNA59613-14I7. 

The entire nucleotide sequence of DNA59613-14I7 is shown in Figure 211 (SEQ ID NO:294). Clone 
DNA59613-1417 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 211). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PR051 1 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 45-48, 73-76, 107-1 10, 1 18- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO:295. A kringle domain begins at about amino acid 128 of SEQ 
ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PR051 1 has some sequence identity are 
as follows: SSC20F10_1; SF041083; PJV26579; S44208; JC2394; PSTA DICDI; A27020; S59310; 
RAG1 RABIT; and MUSBALBC1J. 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PRQ11Q4 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
EnzvmoloKV 266:460^80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616-1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 109-1 1 1 and ending at the stop codon at nucleotide positions 1 132-1 134 of SEQ ID NO:296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214). The full-length PRO1104 
protein shown in Figure 214 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
5 sequencing techniques. 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO:297. N- 
myristoylation sites are at about amino acids 41-46, 110-115, 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 

10 EXAMPLE 96: Isolation of cD NA clones Encoding Human PRQ1 100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 
20 sequence encompassed within the Incyte EST clone no. 2305379, the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA596I9-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 
25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PROl 100 protein 
shown in Figure 216 has an estimated molecular weight of about 36,475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1 , 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
30 techniques. 

Upon analyzing SEQ ID NO:299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-giycosylation site, an N-myristoylation site, a CUB domain and an amiloride-sensitive sodium 
channel domain are present. It is believed that PROl 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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EXAMPLE 97 : Isolation of cDNA clones Encoding Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was men compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained is herein designated DNA56453. 

10 In light of an observed sequence homology between the DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075, the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The encire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO:300). Clone 

15 DNA59620- 1463 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448-1450 of SEQ ID NO:300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO:301) evidences the presence of the 

20 following: a signal peptide, N-glycosylation sites, N-myristoylation sites, a domain conserved in the 
YJLI26w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 
DNA59620-1463 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests that it possesses 

some sequence similarity to SLS1, thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B91, S66714, CRU40057 1 and IMACAEEL. 

30 

EXAMPLE 98 : Isolation of cDNA clones Encoding Human PRO! 141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
35 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2679995, the Incyte EST clone 2679995 was purchased 
5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PROl 141 protein 
10 shown in Figure 220 has an estimated molecular weight of about 26,840 daltons and a pi of about 8.19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 
15 DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L 2, 
PCRB7PRJ 1, AB000506J, LEU95008J, MRU87980J5, YIGMECOLI, STU65700J, GHU62778J, 
20 CYST_SYNY3 and AF009567 1 . 

EXAMPLE 99 : Isolation of cDNA clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROl 132. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : S'-TCCTGTGACCACCCCTCTAACACC^ (SEQ ID NO:310) and 
30 reverse PCR primer : 5*-CTGGAACATCTGCTGCCCAGATTC-3' (SEQ ID NO:31 1). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5 '-GTCGGATGAC AGCAGC AGCCGC ATC ATC A ATGG ATCCG ACTGCGATATGC-3 ' (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 132 and the derived protein sequence for PROl 132. 

The entire nucleotide sequence of PROl 132 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
the histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The full-length PROl 132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309), revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256_1, PW10694, 
MMAE000663_6, AF013988J, U66061J, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

EXAMPLE 100 : Isolation of cDNA clones Encoding Human NL7 (PRO 1346) 

A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example 1 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 
PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 '-CAC ACGTCCAACCTC A ATGGGC AG-3 ' (SEQ ID NO:315) 
reverse PCR primer : 5'-GACCAGCAGGGCCAAGGACAAGG-3' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe: 

5 ' -GTTCTCTG AG ATG AAG ATCCGGCCGGTCCGGG AGTACCGCTT AG-3 ' 
(SEQIDNO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID NO:3 13]) and the derived protein sequence for NL7 
35 (PR01346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID NO:313). Clone 
DNA59776- 1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 1-3 and an apparent stop codon at nucleotide positions 1384-1386. The predicted polypeptide precursor 
is 461 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 t about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human microfibril-associated 
glycoprotein (1 MFA4_HUMAN); to known TIE-2 ligands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101: Isolation of cDNA clones Encoding Human PRO 1131 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

15 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science 253:1278-1280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR primer 5 , -ATGCAGGCCAAGTACAGCAGCAC-3 l (SEQ ID NO:321); 
reverse PCR primer 1 5 ' -C ATGCTG ACG ACTTCCTGC A AGC-3 * (SEQ ID NO:322); and 
reverse PCR primer 1 5'-CCACACAGTCTCTGCTTCTTGGG-3' (SEQ ID NO:323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5'-ATGCTGGATGATGATGGGGACACCACCATGAGCCTGCATT-3' (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 131 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 31,966 daltons and an estimated pi f approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID NO:319 and the region having sequence identity 
5 with LDL receptors is about 50-265 of SEQ ID NO:319. PROl 131 contains potential N-linked glycosylation 
sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:319. Clone DNA59777-1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 2031 1 1 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in.Figure 230 (SEQ ID NO:319), evidenced some sequence 
10 identity between the PRO 1131 amino acid sequence and the following Dayhoff sequences: AB0107101, 149053, 
149115, RNU56863 1, LY4A_MOUSE, 155686, MMU56404 1, 149361, AF030313J and MMU09739J. 

EXAMPLE 102 : Isolation of cDNA clones Encoding Human PRO 1281 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1281 . 

PCR primers (forward and reverse) were synthesized: 
forward PCR primers : 
20 5'-TGGAAGGCTGCCGCAACGACAATC-3 ' (SEQ ID NO:327); 
5'-CTG ATGTGGCCG ATGTTCTG-3 ' (SEQ ID NO:328); and 
5'-ATGGCTCAGTGTGCAGACAG-3' (SEQ ID NO:329). 
reverse PCR primers : 

5 '-GC ATGCTGCTCCGTG A AGTAGTCC-3 ' (SEQ ID NO:330); and 
25 5 '-ATGCATGGGAAAG AAGGCCTGCCC-3 ' (SEQ ID NO:331). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe: 

5 '-TGCACTGGTGACCACGAGGGGGTGC ACTATAGCCATCTGG AGCTG AG-3 ' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

to isolate clones encoding the PRO 1281 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID NO:325]; and the derived protein 

sequence for PR01281 . 



459 



WO 99/63088 



PCT/US99/12252 



The entire coding sequence of PR01281 is shown in Figure 232 (SEQ ID NO:325). Clone DNA59820- 
1549 contains a single pen reading frame with an apparent transl ational initiation site at nucleotide positions 
228-230 and an apparent stop codon at nucleotide positions 2553-2555. The predicted polypeptide precursor 
is 775 amino acids long. The full-length PR01281 protein shown in Figure 233 has an estimated molecular 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide at about 
5 amino acids 1-15; and potential N-glycosylation sites at about amino acids 138-141 and 361-364. 

An analysis of me Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 233 (SEQ ID NO:326), revealed some sequence 
identity between the PR01281 amino acid sequence and the following Dayhoff sequences: S44860, CET24D11 , 
CEC38H2J, CAC2 HAECO, B3A2_HUMAN, S22373, CEF38A3_2, CEC34F6_2, CEC34F63, and 
10 CELT22B11_3. 

Clone DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no. 203129. 

EXAMPLE 103 : Isolation of cDNA clones Encoding Human PRO1064 

A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 

15 BLAST2 sequence alignment computer program, to have no significant sequence identity to any known human 
protein. This cDNA sequence is herein designated DNA45288. The DNA45288 sequence was then compared 
to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database 
(LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison 
was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmology. 

20 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is 
herein designated DNA48609. Oligonucleotide primers based upon the DNA48609 sequence were then 
synthesized and employed to screen a human fetal kidney cDNA library which resulted in the identification of 

25 the DNA59827-1426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 253 : 1278-1280 (1991)), and the cDNA 
size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5 , -CTGAGACCCTGCAGCACCATCTG-3 , (SEQ ID NO:336) 

30 reverse PCR primer 5'-GGTGCTTCTTGAGCCCCACTTAGC-3 , (SEP ID NO:337) ' 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
hybridization probe 

5*-AATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGT-3' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 532-534 and a stop signal at nucleotide positions 991-993 
(Figure 234, SEQ ID N0.333). The predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,317 daltons and an estimated pi of approximately 5.17. Analysis of the 
full-length PRO1064 sequence shown in Figure 235 (SEQ ID NO:334) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10, an indole-3-glycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeat protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827-1426 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 235 (SEQ ID NO:334), evidenced homology 
between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PRCM, 
10 BP187PLYH_l,CELF42G8_4,MMU58888 > _l f GEN14270,TUB8_SOLTU,RCN - MOUSE,HUMRBSY79_l, 
SESENODA J and A21467J . 

EXAMPLE 104 : Isolation of cDNA clones Encoding Human PRO 1379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1379. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 ' -TGG AC ACCGTACCCTGGTATCTGC-3 ' (SEQ ID NO:341) 
reverse PCR primer 5'-CCAACTCTGAGGAGAGCAAGTGGC-3* (SEQ ID NO:342) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
hybridization probe 

25 5'-TGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAAC-3' (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 

derived protein sequence for PR01379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 

1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The prechcted polypeptide precursor is 574 

amino acids long. The full-length PRO 1379 protein shown in Figure 238 has an estimated molecular weight of 

about 65,355 daltons and a pi of about 8.73. Additional fearures include a signal peptide at about amino acids 
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1-17 and potential N-glycosyiation sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 238 (SEQ ID NO: 340), revealed some homology 
between the PRO 1379 amino acid sequence and the following Dayhoff sequences: YHY8YEAST, AF040625 1, 
HP714394J, and HIV18U45630J. 
5 Clone DNA59828-1608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 

EXAMPLE 105 : Isolation of cDNA Clones Encoding Human PRQ844 

An expressed sequence tag (EST) DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, the clone for this EST, Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO:344) which includes the full-length DNA sequence for PR0844 and the derived protein sequence 
for PR0844. 

15 The entire nucleotide sequence of DNA59838-1462 is shown in Figure 239 (SEQ ID NO:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 
The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 
shown in Figure 240 has an estimated molecular weight of about 12,050 daltons and a pi of about 5.45. Clone 

20 UNQ544 DNA59838-1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALK1_HUMAN, PJ>82403, P_P82402, ELAF_HUMAN and P_P60950. 

EXAMPLE 106 : Isolation of cDNA Clones Encoding Human PRQ848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

35 Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 276857 1 , the Incyte EST clone 2768571 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-1461. 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839- 1461 
10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0848 polypeptide suggests that it may be a 
novel sialyltransf erase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 
15 sequences, P_R78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5 CHICK (alpha-n-acetylgalactosamide 
. alpha-2,6-sialytransferase),HSU14550_l,CAG6_HUMANand PR632 17 (human alpha-2, 3-sialyltransferase). 

EXAMPLE 107 : Isolation of cDNA Clones Encodine Human PRO1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Aitshul et al., Methods in 
Enzvmology 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2408105, the Incyte EST clone 2408105 was purchased 

30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA5984 1-1460. 

The entire nucleotide sequence of DNA59841-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO: 348 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO 1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 
DN A5984 1 - 1 460 has been deposited with ATCC on July 1 , 1 998 . It is understood that the deposited clone has 
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the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoprotease family protein domain starts at about amino acid 56, and the acyltransferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

EXAMPLE 108 : Isolation of cDNA clones Encoding Human PRQ1153 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap n (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 

20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 amino acids long. PROl 153 has a calculated molecular weight of approximately 2 1 ,540 daltons and an 
estimated pi of approximately 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 

25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 153 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPCJ; S46965; PJ>82971; VCPHEROPHJ; EXTNTOBAC; MLCB2548_9; 

30 ANXA_RABIT; JC5437 and SSGP_VOLCA. 

EXAMPLE 109 : Isolation of cDNA clones Encoding Human PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
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Enzvmoiogy 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST clone no. 2169375, the Incyte EST clone 2169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-291 1 (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PROl 154 has a calculated molecular weight of approximately 107,144 daltons and an 
estimated pi of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
15 length sequence, PROl 154 shows sequence identity to at least the following Dayhoff designations: AB01 1097 1 , 
AMPN_HUMAN, RNU76997 1, 159331, GEN 14047, HSU62768J,P_R5 1281, CET07F1 0J, SSU66371 J, 
and AMPRE HUMAN. 

EXAMPLE 1 10: Isolation of cDNA clones Encoding Human PRO l 181 

20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alio, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (Altshul et ah, Methods in Enzvmoiogy 266:460^80 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 2186536, the Incyte EST clone 2186536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-151 1 . 

Clone DNA59847-1511 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22. Analysis 
of the full-length PROl 181 sequence shown in Figure 250 (SEQ ID NO:355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-glycosylation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59847-1511 has been deposited with ATCC on August 4, 1998 and is assigned 
5 ATCC deposit no. 203098. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO:355), evidenced homology 
between the PROH81 amino acid sequence and the following Dayhoff sequences: AF041083_1, PW26579, 
RNMAGPIAN1, CELT13C2_2, LMSAP2GNJ, S61882, CEF35C512, DP87 DICDI, GIU47631J and 
10 P_R07092. 

EXAMPLE 111 : Isolation of cDNA clones Encoding Human PRO 1 182 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033. 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 25 1 ). The 
predicted polypeptide precursor is 271 amino acids long (Figure 252). The full-length PROl 182 protein shown 
in Figure 252 has an estimated molecular weight of about 28,665 daltons and a pi of about 5.33. Analysis of 

30 the full-length PROl 182 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25, an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino acid 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-1512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 

35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357), evidenced significant 
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homology between the PROl 182 amino acid sequence and the following Dayhoff sequences: PSPD BOVIN, 
CL43_BOVIN, CONG_BOVIN, PW18780, P_R45005, P_R53257 and CELEGAP7 1. 

EXAMPLE 112 : Isolation of cDNA clones Encoding Human PROl 155 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2858870, the Incyte EST clone 2858870 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 
positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
NO:359) is 135 amino acids long. PROl 155 has a calculated molecular weight of approximately 14,833 daltons 
and an estimated pi of approximately 9.78. Clone DNA59849-1504 has been deposited with ATCC and is 
assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 
herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 155 shows some amino acid sequence identity with the following Dayhoff designations: 
TKNK_BOVIN; PVB19X587J; AF019049J; P W00948; S72864; P_W00949; 162742; AF038501J; 
TKNG_HUMAN; and YATl_RHOBL. Based on the information provided herein, PROl 155 may play a role 
in providing neuroprotection and cognitive enhancement. 

EXAMPLE 113 : Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater mat did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255' contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256, SEQ ID 
NO:361) is 159 amino acids long. PROl 156 has a calculated molecular weight of approximately 17,476 daltons, 
10 an estimated pi of approximately 9.15, a signal peptide sequence at about amino acids 1 to about 22, and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
15 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PROl 156 amino acid sequence and the following Dayhoff 
sequences: D45027 1, P_R799!4, JC5309, KBF2HUMAN, AF010144 1, GEN14351, S68681, P R79915, 
ZMTAC 3, and HUMCPGOJ. 

20 EXAMPLE 114 : Isolation of cDNA Clones Encoding Human PRO 1098 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program w phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 

30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 

35 DNA59854-1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO 1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,396 daJtons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO:363, 
an N-glycosylation site appears to be at about amino acids 37-40 of SEQ ID NO:363, and N-myristoylation sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115 : Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In light of an observed sequence homology between the DNA57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484. 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 
NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daltons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROl 127 amino acid sequence and the following Dayhoff sequences: AF037218 48, PW09638, 
HBAHETPO, S39821, KR2_EBV, CET20D3J, HCU37630J, HS193B12J0, S40012 and TRITUBCJ. 



469 



WO 99/63088 



PCT/US99/12252 



EXAMPLE 116 : Isolation of cDNA clones Encoding Human PRO 1126 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program w phrap n (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 

10 In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA60615-1483. 

Clone DNA60615-1483 contains a single open reading frame with an apparent translational initiation 

15 site at nucleotide positions 1 10- 1 12 and ending at the stopcodon at nucleotide positions 1316-1318 (Figure 261). 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PROl 126 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60. Analysis 
of the full-length PRO 1126 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 

20 from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino acid 141 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 262 (SEQ ID NO:367), evidenced significant 

25 homology between the PROl 126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMR_HUMAN, MMUSMYOC3J , HS454G6J, P_R98225, RNU78105J, RNU72487J, AF035301J, 
CEELC48E7_4 and CEF1 1C3JS. 

EXAMPLE 117 : Isolation of cDNA clones Encoding Human PROl 125 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

35 Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyte EST clone 14861 14 was purchased 
and the cDNA insert was btained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47*49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PROl 125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-1482 has been deposited with ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1 125 shows some sequence identity with the following Dayhoff designations: 
RC01_NEUCR; S58306; PKWAJTHECU; S76086; P_R8588l; HETi PODAN; SPU92792 1; 

15 APAF H UM AN ; S76414 and S59317. 

EXAMPLE 118 : Isolation of cDNA clones Encoding Human PROl 186 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 91-93 and ending at the stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PROl 186 has a 

35 calculated molecular weight of approximately 1 1 ,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA60621-1516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203091 . 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PR01186 amino acid sequence and the following Dayhoff sequences: VPRADENPO, 
LFE4CHICK, AF034208J, AF030433J, A55035, COLRABIT, CELB0507 9, S67826J, S34665 and 
CRU73817J. 

5 

EXAMPLE 1 19: Isolation of cDNA clones Encoding Human PRO 1198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g., GenBank), and a proprietary 

10 EST database (UFESEQ® Incyte Pharmaceuticals, Palo Alto, CA), using the computer program BLAST or 
BLAST2 [Altschul et al., Methods in Enzvmology. 266:460-480 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program tt phrap n (Phil Green, University of 
Washington, Seattle, Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 

15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 

20 reading frame with an apparent translational initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373), is 229 amino acids long. PROl 198 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9. 17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyi hydrolases family 31 protein at about amino acids 

25 142 to about 175. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO:373), revealed some homology 
between the PROl 198 amino acid sequence and the following Dayhoff sequences: ATF6H116, UCRIRAT, 
TOBSUP2NTJ, RCUERF3J, AMU88186J, PJV22485, S56579, AF04071IJ, DPP4 PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203090. 

EXAMPLE 120: Isolation of cDNA clones Encoding Human PROl 158 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
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homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
r greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the lncyte EST clone no. 2640776, the lncyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

The full length clone shown in Figure 269 contained a single open reading frame with an apparent 

10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long. PR01158 has a calculated molecular weight of approximately 13,113 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 

15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no . 209975 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PR01158 amino acid sequence and the following Dayhoff sequences: ATAC0O31O510F18A8.10, 

20 P_R85151, PHS2_SOLTU, RNMHCIBACJ, RNA1FMHC1, 168771, RNRT1A10G1, PTPAHUMAN, 
HUMGACA J , and CHKPTPA J . 

EXAMPLE 121 : Isolation of cDNA clones Encoding Human PROl 159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DN A5722 1 . 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 376776, the lncyte EST clone 376776 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 27 1). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PROl 159 protein shown 
in Figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
the full-length PROl 159 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41. Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
homology between the PROl 159 amino acid sequence and the following Dayhoff sequences: AF016494_6, 
AF0367O8J20, DSSCUTEJ, D89100J, S28060, MEFAJCENLA, AF020798_12,G70065,E64423,JQ2005. 



EXAMPLE 122: Isolation of cDNA clones Encoding Human PROl 124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-1481. 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ ID NO:379) 
is 919 amino acids long. PROl 124 has a calculated molecular weight of approximately 101 ,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is understood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PROl 124 shows 
significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLC BOVIN, 
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AF001261J, P_W06548, SSC6A10J, AF004355J, S76691, AF017642, BYU06866_2, CSAJD1CDI and 
SAU47139_2. 

EXAMPLE 123 : Isolation of cDNA clones Encoding Human PRO 1287 

An expressed sequence lag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g. , GenBank), and a proprietary EST 
database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschu! et ah, Methods in 
EnzvmologY, 266:460^80 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 
90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program w phrap w (Phil Green, University of Washington, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full- 
length coding sequence for PR01287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 100- 1000 bp in length. The probe sequences 
are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about l-1.5kbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et al. , Current Protocols in Molecular Biology. 
supra, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 '-CTCGGGGAAAGGGACTTG ATGTTGG-3 ' (SEQ ID NO:382) 
reverse PCR primer 1 5 ' -GCG AAGGTG AGCCTCT ATCTCGTGCC-3 ' (SEQ ID NO:383) 
reverse PCR primer 2 5 ' -C AGCCTAC ACGT ATTG AGG-3 * (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
hybridization probe 

5 '-CAGTC AGTAC A ATCCTGGC ATAATATACGGCCACC ATG ATGC AGTCCC-3 * (SEQ ID NO:385). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PRO 1287 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of me cDNA libraries was isolated from human bone marrow tissue. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDN A was primed with oiigo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
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pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Holmes et al. , Science . 253 : 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1554 [Figure 275, SEQ ID NO:380J) and the derived protein 
sequence for PR01287. 

5 The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 

full length clone contained a single open reading frame with an apparent translational initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,351 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PR01287 sequence shown 
10 in Figure 276 (SEQ ID NO:381) evidences the presence of the following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA61755-1554 has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203112. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
15 alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO:38l), evidenced significant 
homology between the PR01287 amino acid sequence and the following Dayhoff sequences: CET24D1 1, 
EZRI_BOVIN, GGU19889J, CC3_YEAST, S74244, NALS_MOUSE, M0ES_P1G, S28660, S44860 and 
YNA4 CAEEL. 

20 EXAMPLE 124: Isolation of cDNA clones Encoding Human PRO 13 1? 

DNA55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The full length DNA61873-1574 clone shown in Figure 277 (SEQ ID NO:386) contained a single open 
25 reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 
30 76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID NO:387), revealed some homology 
between the PRO 13 12 amino acid sequence and the following Dayhoff sequences: GCINTALPH 1, 
GIBMUC1AJ, P_R96298, AF001406J, PVU88874J, P_R85151, AF041409J, CELC50F2_7, C45875, 
35 and AB009510 21. 

Clone DNA61873-1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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EXAMPLE 125: Isolation of c DNA clones Encoding Human PROl 192 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PROl 192. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer: 5 ' -CCG AGGCC ATCTAGAGGCC AG AGC-3 ' (SEQ ID NO:390) 
reverse PCR primer: 5 ' -AC AGGCAGAGCCAATGGCCAG AGC-3 ' (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hybridization probe: 

5'-GAGAGGACTGCGGGAGTTTGGGACCTTTGTGCAGACGTGCTCATG-3' (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identif.ed above. A positive library was then used to 
15 isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01192 designated herein as DNA62814-1521 and shown in Figure 279 (SEQ ID NO:388); and the derived 
protein sequence for PROl 192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROl 192 is shown in Figure 279 (SEQ ID NO:388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent translations initiation site at nucleotide positions 
121-123 and an apparent stop codon at nucleotide positions 766-768. The predicied polypeptide precursor is 215 
amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 
amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 1 18-121 ; and homology with myelin P0 proteins at about amino acids 27-68 and 
99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 
of about 24,484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO:389), revealed homology 
between the PROH92 amino acid sequence and the followingDayhoff sequences: GEN12838,MYP0_HUMAN. 
AF049498J, GEN14531, P_W14146, HS46KDAJ, CINB RAT, OX2G_RAT, D87018J, and D86996_2. 

Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 
deposit no. 203093. 

35 
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EXAMPLE 126: Isolation of cDNA clones Encoding Human PRO! Ifif) 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PRO1160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer S'-GCTCCCTGATCTTCATGTCACCACCO- (SEQ ID NO:395) 
reverse PCR primer S'-CACjGGACACACTCTACCATTCGGGAG-S' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
hybridization probe 

5 ' -CC ATCTTTCTGGTCTCTGCCC AG AATCCG AC AACAGCTGCTC-3 ' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 160 (designated herein as DNA62872-1509 [Figure 281. SEQ ID NO: 393]) and the derived protein 
sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-1509 contains a single open reading frame with an apparent translations initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pi of about 4.37. Analysis of the full-length 
25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO: 394), evidenced significant 
homology between the PROl 160 amino acid sequence and the following Dayhoff sequences: B30305, 
GEN13490, 153641, S53363, HA34 BRELC, SP96_DICDI, S36326, SSU51197_10, MUC1_XENLA, 
TCU32448J and AF000409J. 

35 EXAMPLE 127: Isolation of cDNA clones Encoding Human PROl 187 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence lag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods ir 
EnzymoloftY 26 6:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 358563, the lncyte EST clone 358563 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO:399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO:399. 
15 PROH87 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO:399), revealed some sequence 
identity (and therefore some relation) between the PRO 1 187 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBXJ, CELF41G3_9, AMPGSTRLI, HSBBOVHERL2, LEEXTEN101, 
AF029958J and P_W04957. 

25 EXAMPLE 128: Isolation of cDNA clones Encoding Human PRO 11 85 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the lncyte EST clone no. 32844 1 1 , the lncyte EST clone 32844 1 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fuU-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA62881-1515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:400). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO.-401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:401 . 
PROH85 has a calculated molecular weight of approximately 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
identity between the PR01185 amino acid sequence and the following Dayhoff sequences: TUP1_YEAST, 
AF041382J, MAOM_SOLTU. SPPBPHU9_ 1,14 1024, EPCPLCFAILJ , HSPLEC_1, YKL4CAEEL, 
A44643, TGU65922J. 

EXAMPLE 129: Isolation of cDNA clones Encoding Human PR0 1 345 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01345. 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-CCTGGTTATCCCCAGGAACTCCGAC-3' (SEQ ID NO:404) 
reverse PCR primer 5 -CTCTTGCTGCTGCGAC AGGCCTC-3~ (SEQ ID NO:405) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 
sequence which had the following nucleotide sequence 
) hybridization probe 

5 -CGCCCTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGAC-3 1 (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01345 gene using the probe oligonucleotide and one of the PCR primers. RNA 
) for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402]) and the derived protein 
sequence for PR01345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
DNA64852-1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 
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288 has an estimated molecular weight of about 23, 190 daltons and a pi of about 9.40. Analysis of the full- 
length PR01345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 31 or from about amino acid 10 to about amino acid 
31 and a C-rype lectin domain signature sequence from about amino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18, 1998 and is assigned ATCC deposit no. 203127. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403), evidenced significant 
homology between the PR01345 amino acid sequence and the following Dayhoff sequences: BTU22298J, 
TETN.CARSP, TETNHUMAN, MABA.RAT, S34198, P_W13144. M ACMBPAl , A46274, PSPD RAT 
AND P_R32188. 

EXAMPLE 130: Isolation of c DNA clones Encoding Human PRO! 245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
E JM molog i: 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green. University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1327836, the Incyte EST clone 1327836 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884-1527. 

The full length clone shown in Figure 289 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PR01245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 
approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA_THETH, GEN1 1 167, 
MTV044_4, AB011151J. RLAJ2750_3, SNELIPTRA l , S63624, C28391, A37907."and S14064. 

Clone DNA64884-1245 was deposited with the ATCC on August 25. 1998 and is assigned ATCC 
deposit no. 203155. 
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EXAMPLE 131: Isolation of c DNA clones Encoding Human PRO 1358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g . , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzymology 266:4«W80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
10 encompassed within the Incyte EST clone no. 88718, the Incyte EST clone 88718 was purchased and the cDNA 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-1612. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 
positions 1418 through 1420 (Figure 291 ; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292. 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PR01358 has a calculated molecular weight of approximately 50,719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO:4 10), revealed sequence identity 
between the PROI358 amino acid sequence and the following Dayhoff sequences: P_W07607, AB000545 1, 
AB000546_1, A1ATRAT, AB0151641.PP50021, COTRCAVPO, and HAMHPP~ 1. The variants claimed 
in this application exclude these sequences. 

EXAMPLE 132: Isolation of c DNA clones Encoding Human PRO 1 195 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-180 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA55716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID NO:411). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
identity between the PROl 195 amino acid sequence and the following Dayhoff sequences: MMU28486_1, 
AF044205J, PW31186, CELK03C7J, F69034, EF1AJ4ETVA, AF024540J, SSU90353J, 
MRSP_STAAU and P_R97680. 

EXAMPLE 133 : Isolation of cDNA clones Encoding Human PRO 1270 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951. 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 
The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5.71 . Analysis 
of the full-length PRO1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 
74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 
homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699 1, 
S49589,HBA_PARPA,FIBB_HUMAN,P_R47^ 
and FIBGBOVIN. 

5 

EXAMPLE 134: Isolation of cDNA clones Encoding Human PRO 1271 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350, the Merck EST clone AA625350 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309-1538. 
20 Clone DNA66309-1538 contains a single open reading frame with an apparent translation^ initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 7 18-720 (Figure 297). The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PR01271 protein shown 
in Figure 298 has an estimated molecular weight of about 21,531 daltons and a pi of about 8.99. Analysis of 
the full-length PR01271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 
25 following: a signal peptide from about amino acid 1 to about amino acid 3 1 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 
30 homology between the PR01271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AG A 1_ YEAST, BPU43599J, YS8ACAEEL, S67570, LSU54556_2, S7O305, VGLX_HSVEB, and 
. D88733J. 

EXAMPLE 135: Isolation of cDNA clones Encoding Human PRQ137S 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank, Merck/Wash. U.). and a proprietary EST database (LIFESEQ*. Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et al., Methods in Enzvmology. 266:460-480 (1996)) as a comparison of the extracellular domain 
(BCD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program u phrap w (Phil Green, University of Washington, 
Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein a DNA67003 w . 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:417) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PR01375 and 
the derived protein sequence for PRO 1375. 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:4 1 7). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 1 1-28 
(type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203115. The full-length PR01375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8), revealed sequence identity 
between the PR01375 amino acid sequence and the following Dayhoffsequences:AF026198_5, CELR12C12 5, 
S73465, Y011.MYCPN, S64538J, P_P8150, MUVSHPO10J, VSH_MUMPL and CVU59751_5. 

EXAMPLE 136: Isolation of cDNA clones Encoding Human PRO 1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Aitshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 
and the cDNA insert was btained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 410-412 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PR01385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28, and glycosylaminoglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8_1, 
LMNACHRA1J, HXD9_HUMAN, CHKCMLF 1, HS5PP34 2, DMDRING1, A37107J, 
MMLUNGENEJ , PUM_DROME and DMU251 17 J . 

15 EXAMPLE 137 : Isolation of cDNA clones Encoding Human PRO 1387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 

25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PRO 1387 protein 
shown in Figure 304 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. Analysis 
of the full-length PR01387 sequence shown in Figure 304 (SEQ ID NO:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

35 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to aboui amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acid 
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379, and amino acid sequence blocks having homology ro myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
5 homology between the PR01387 amino acid sequence and the following Dayhoff sequences: P_W36955, 
MYP0HETFR, HS46KDAJ, AF049498 1, MYO0 HUMAN, AF030454J, A53268, SHPTCRAJ, 
P_W14146andGEN12838. 

EXAMPLE 138 : Isolation of cDNA clones Encoding Human PRQ1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54I92 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01384. 

PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5*-TGCAGCCCCTGTGACACAAACTGG-3 * (SEQ ID NO:425) 
reverse PCR primer 5 , -CTGAGATAACCGAGCCATCCTCCCAC-T (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 
sequence which had the following nucleotide sequence: 
hybridization probe 

20 5 ' -GG AG AT AGCTGCTATGGGTTCTTC AGGC AC AACTT A AC ATGGG AAG-3 * (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1384 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71 159-1617 [Figure 305, SEQ ID NO:423]; and the derived protein 

sequence for PR01384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA71 159- 
1617 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 

30 1 82- 1 84 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosylation sites at about amino acids 68-71, 120-123, and 134-137. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PRO 1384 amino acid sequence and the following Dayhoff sequences: AF054819J , HSAJ 1687_1 , 
AF0095 1 1_1 , AB010710J , GEN 13595, HSAJ673J , GEN 13961 , AB005900J , LECH _CHICK,AF021349_1, 
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and NK13_RAT. 

Clone DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO as a hybridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 
5 DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 

a probe to screen for homologous DN As (such as those encoding naturally-occurring variants of PRO) in human 
tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
10 solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate, 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. lx SSC and 0. 1 % SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can 
then be identified using standard techniques known in the art. 

15 

EXAMPLE 140: Expression of PRO in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
20 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli; see Bolivar et al., Gene, 2j95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
25 for an antibiotic resistance gene, a up promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected £. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
30 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 
- 35 After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 

pellet obtained by the centrifugatiori can be solubilized using various agents known in the an, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in £. coli in a poly-His tagged form, using the following procedure. The DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR-arnplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(ladq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate*2H20, 1.07 g KC1, 5.36 g Difco 
10 yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgSOJ and grown for approximately 20-30 hours at 30°C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centriruged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

£. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centriruged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 
20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 
25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCi, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
ofTFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 
30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1 % 
TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
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from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 141 : Expression of PRO in m ammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al., supra . The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 ceils (ATCC CCL 1573) are 

15 grown to confluence in tissue culture plates in medium such as DM EM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 fig pRK5-PRO DNA is mixed with about 1 
DNA encoding the VA RNA gene [Thimmappaya et al., Cell, 31:543 (1982)) and dissolved in 500 y\ of 1 mM 
Tris-HCl, 0.1 mM EDTA, 0.227 M CaCU. To this mixture is added, dropwise, 500 pi of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 

20 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 /iCi/ml 35 S-cysteine and 200 jiCi/ml 35 S-methionine. After 

25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15 % 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 

30 method described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 jig pRK5-PRO DNA is added . The cells are first concentrated from 
the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 ixgfml bovine insulin and 
35 0. 1 jig/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaP0 4 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
M S-methionine. After deteirmining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can men be subcloned into a SV40 

10 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 

15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology, Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5* and 3' 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect" (Quiagen), Dosper* or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10' 7 cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vonexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
fim filtered PS20 with 5% 0.2 /^m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37?C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 5 cells/mL. The cell media is exchanged with fresh media by 
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centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1.2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
5 polydunethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70 % , the cell culture is harvested by centrifugation and filtering through a 0.22 /^m filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
10 purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 
20 ml fractions into rubes containing 275 of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

25 EXAMPLE 142 : Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

• First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DN A encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by 

35 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 143: Expression of PRO in Baculovirus-lnfected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus 
DNA (Pharmingen) into Spodopterafrugiperda CSf9") cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4 - 5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
al., Baculovirus expression vectors: A Laboratory Manual, Oxford: Oxford University Press (1994). 

Expressed poly-his tagged PRO can then be purified, for example, by Ni 2+ -chelate affinity 
chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature, 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 inM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 ptm 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A^ with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 280 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His I0 -tagged PRO are 
pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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EXAMPLE 144: Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 

Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra. Immunogens that may be employed include purified PRO, fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into die animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
"positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145: Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 
of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
is purified by immunoaffmity chromatography using antibodies specific for the PRO polypeptide of interest. In 
general, an immunoaffmity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 
an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 
sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, NJ.). 
Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an imrnunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential cemrifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide^ntaining preparation is passed over the immunoaffmity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 antibody /PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146: Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

5 

EXAMPLE 147 : Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest (i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 

10 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f. , Hodgson, 
Bio/Technology. 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 

15 to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding 
the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved - 
activity or stability as shown by Braxton and Wells, Biochemistry. 31*7796-7801 (1992) or which act as 

20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda el aL y J. Biochem.. 113:742-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic 

25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 

30 to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 
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Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209, USA (ATCC): . 
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These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 

35 between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 

40 886 OG 638). 

The assignee of the present application has agreed mat if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
45 accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the an from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS : 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting f the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ ID NO: 117), Figure 70 (SEQ ID NO: 1 19), 

10 Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 
(SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO: 165), Figure 301 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ 
ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO: 179), 

15 Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ ID NO:201), 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID NO:211), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216), 
Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ 

20 ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:23I), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), 
Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272). Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303) t Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), 
Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ 
ID NO:418) t Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2 . The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 
ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID NO: 19), Figure 14 (SEQ 
ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID NO:32), Figure 24 
(SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 
33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO:72), 
Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 1 12), Figure 65 (SEQ ID NO: 1 14), Figure 67 
(SEQ ID NO: 116), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 (SEQ ID NO: 128), 
Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137). Figure 82 (SEQ ID NO: 145), Figure 84 (SEQ ID 
NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID NO: 155), Figure 94 
(SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 (SEQ ID NO: 166), 
Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ 
ID NO:176), Figure 112 (SEQ ID NO:178), Figure 114 (SEQ ID NO:180), Figure 116 (SEQ ID NO:182), 
- Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID NO:196), Figure 126 (SEQ 
ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), 
Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 
ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288), 
Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 21 1 (SEQ ID NO:294), Figure 213 (SEQ 
ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 
ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ 
ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 
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Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413), 
Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
ID NO:421) and Figure 305 (SEQ ID NO:423). 

3 . The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of die sequence shown in Figure 1 (SEQ 
ID NO: 1), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 1 1 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 
NO: 1 14), Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 1 18), Figure 7 1 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO:128), Figure 75, (SEQ ID. NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO:145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID r?JO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO^OO), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206),- Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
.148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO: 245),. Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID.NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380) t Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

. 4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 
under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1. 

6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 
transformed with the vector. 

7. A host cell comprising the vector of Claim 5. 

8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an E. coli. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 
selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 
34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 1 13), Figure 66 (SEQ ID NO: 1 15), Figure 68 



504 



WO 99/63088 



PCT/US99/12252 



(SEQ ID NO: 117), Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 15 1), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), 
Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ 
5 ID NO:177), Figure 113 (SEQ ID NO: 179), Figure 115 (SEQ ID NO:181), Figure 117 (SEQ ID NO: 183), 
Figure 120 (SEQ ID NO:189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ 
ID NO: 199), Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21 1), Figure 141 (SEQ 
ID NO:213), Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2. 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

5 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 
10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21. An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 

NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 
NO: 1 14), Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 

25 NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 

30 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 

35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO: 1 14), Figure 67 (SEQ ID NO: 1 16), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO:164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO:182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259) t Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

10 NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 

CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 
CCTGGTGCACCACCACCATGTTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 
GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAGAATCA 
ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 
GAACTGGCCAAGAACTCAAAGAGGCAGCATTGGAACCATCGATGGAAAAAATATTTAAAATT 
GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGGTCTTGGAGCATTGTGCTA 
CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 
ATGTCAAGGATAGAATTCATTCCACCTATATGTACTTAGCAGGGAGTATTGGTTTAACAGCT 
TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 
GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCTGGTACGATCAATAC 
CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 
GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 
AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 
ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 
TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 
AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 
CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 
ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 
AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 
ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 
TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 
AATCCTCTCCCAAATAAGCACACACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 
TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 
TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 
GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 
GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 
GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 
GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCTCCTGCTGTTG 
CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 
ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 
ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATGAAATATGTTGCTTTTTCCAG 
AATACAAACAGTATACTCATG 
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MLAARLVCLRTLPSRVFHPAFT 

KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCY^ 

HSTYIWLAGSIGLTALSAIAISRTPVLM^ 

GPKHLAWLLHSGVMGAWAPLTILGGPLLIRAAWYT^ 

GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 
QKYDP INSMLS I YMDTLNI FMRVATMLATGGNRKK 
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GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 

CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGGCCGCGTCGTGAGGG 

GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 

ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 

GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATCAGCCCGGCCTACCTCTTCCTCTGGCCCGA 

AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTCCCTGTGG 

GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 

CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 

GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 

TGTCAGTACTTTATGTCTGGGCCCAGCTGAACAGAGACATGATTGTATCATTTTGGTTTGGA 

ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 

CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 

GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCCACACCTCAGTTTTTGTACCGCTGG 

CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 

TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 

AGTGAAGGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 

CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 

AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 

TCTCATTCAAGTCCTTACTGCTGTGAAGAACAAATACCAACTGTGCAAATTGCAAAACTGAC 

TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 

AATCTGCTGGCATTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 

TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 

CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 

ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 

AAGACAGCCACGGATGAAGCGTTTCTCAGCTTTTGGAATTGCTTCGACTGACATCCGTTGTT 

AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 

CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 

TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 

TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 

TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 

ATTCCCATTTCAATCTCATTCTGGATATGTGTTCATTGAGTAAAGGAGGAGAGACCCTCATA 

CGGTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 

GAGGAAGGCGCAGCTCCTCTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAGCACATCTA 

AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 

TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 

TCGTAGGTGGGCTTTTCCTATCAGAGCTTGGCTCATAACCAAATAAAGTTTTTTGAAGGCCA 

TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 

ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 

TTCAGGAAACATTGTGCTCTAACAGTATGACTATTCTTTCCCCCACTCTTAAACAGTGTGAT 

GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 

TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 

TGTTTTGTTGTTCATCTGTGGCCACAATAAAGTTTACTTGTAAAATTTTAGAGGCCATTACT 

CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAGAATA 

TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 

CAGTCATTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 

ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 

ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 

TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 

TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 
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MSDI GDWFRS I PA I TRYWFAATVAVPLVGKLGL I S P AYLFLWPEAFLYRFQ I WRP I TATF YF 
PVGPGTGFLYLVNLYFLYQYSTRLETGAFDGRPADYLFMLLFNWICIVITGLAMDMQLLMIP 
LIMSVLYVWAQLNRDM I VSFWFGTRFKACYLPWVI LGFNY 1 1 GGSVINEL IGNLVGHLYFFL 
MFRYPMDI/^RNFLSTPQFLYRWLPSRRGGVSGFGVPPASMRRAADQNGGGGRHNWGQGFRLGDQ 
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FIGURE 5 

GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 

GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCTCTTCCCCTACTCCCTCTCGGCT 

CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 

CCTTTGGGGCGGGATGGCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 

CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTTGGACTTTGTGGAACAGAAATGT 

GAAGTTAACTGCAAAGGAGGGCATGTGATAACTCCAGGAAGCCCAGAGCCGGTGATTTTGGT 

GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 

ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAATT 

AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 

CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 

AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 

GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 

GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAAGAAGAAAGGAAGAGGAAAA 

AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAGTGAAGCTGCAATAATG 

AATAATTCCCAAGGGGATGGTGAACATTTTGCACACCCACCCTCAGAAGTTAAAATGCATTT 

TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 

CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 

TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAACACTATCTCAAGCAGAAGAGAGA 

TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGAGCAGAAAG 

GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 

CAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAGTA 

ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAAAAATAAATTATTTAGTC 

CTTACACTG 
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FTGTJRE6 

MAAEEEDEVEWVVESIAGFLRGPDWSIPILDFVEQKCEVNCKGGHVITPGSPEPVILVACVP 
LVFDDEEESKLTYTEIHQEYKELVEKLLEGYLKEIGINEDQFQEACTSPLAKTHTSQAILQP 

VLAAEDFT I FKAMMVQKN I EMQLQAI R 1 1 QERNGVL PDCLTDGSDWSDLEHEEMKI LREVL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 

I EPLGRKVERSETS SLPQKGLK I PGLEHAS I EGP I ANL S VLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQ I QNMEQKGKPTGE VEEMTEKPEMT AEEKQTLLKRRLLAEKLKEEVI NK 
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FTGTJRE 7 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGAT 
TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAA 
TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAG 
GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCC 
AGAAAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTA 
CCTGACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAAT 
CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 
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FIGURE 8 

GCGTGGTTTTTGTTCTGC^^TAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 
TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 
TCTCAGCTCTAGGATGTGCGTTCTTCO^CTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 
AGTGGAAT^AAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 
ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 
CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 
GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCTkAAGTAG 
AAATTTGAAATATGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCT 
TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 
GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 
GAGGCGTCTAAACTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 
TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 
TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 
TACAGCAAAGGAATGGACTTTTCCTGAAGCTAAATGGAACACCACAGCCAGAGTTTTCAGTC 
ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 
AATATCTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 
GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 
GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 
CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 
CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 
TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 
TTTATTTATAATGCCAGCAAGCCTCAAGTTCCGGAATACGCACCTAGGCAAGAAAGGATCCG 
AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 
CCAAACCGAAGAGTGATGAGTCAGATGAAGATACTTTCTAACTGGTACCCACATAGTTTGCA 
GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 
AACCAGAAATGTTTCTAAATCCTAATATTCTTTGCATATATCTAGCTACTCCCTAAATGGTT 
CCATCCAAGGCTTAGAGTACCCAAAGGCTAAGAAATTCTAAAGAACTGATACAGGAGTAACA 
ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 
TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 
ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 
CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 
CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGGACCTAAATAC 
CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 
TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 
AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 
AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 
GTCATTACTCCTGAATTATTACATTTTGGAGAATAAGAGGGCATTTTATTTTATTAGTTACT 
AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 
CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 
AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 
AGTAATATATAAGTCACTTTACAGTGCTACTTCACACTTAAAAGTGCATGGTATTTTTCATG 
GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 
AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 
CTTTAACTGGCAGGCCTGTATGTTTAC^GACTACC^TACTGTAAATATGAGCTTTATGGTGT 
CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 
AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 
GTGGAAATTTGTAATTAAAATAATTATTAAACCT 
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FIGURE 9 

MEKQCCSHPVICSLSTI^TFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 
LVFCtfLVSFCVIKKDHQSRNLKYASWKEF 

I FSNFS 1 1 TTALL FR I VL KRRLNW I QWAS LLTLFLS I VALTAGTKTLQHNLAGRGFHHDAFF 
SPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISSMANI 
YNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALI 
FVTAFQGLSVAFILKFLDNMFHVLMAQVTTVI ITTVSVLVFDFRPSLEFFLEAPSVLLSIFI 
YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 
TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCCAC 
GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 
GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 
AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 
TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 
AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 
TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAGAT^ATTTGAAATA 
TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 
TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 
TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 
CTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGGA 
CTAAAACTTTA 
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FIGURE 11 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCGGCTCCGCGGCGGGGCTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTATG 
CACTTGGGGG AG CC CC CT AC CAGGTG ACGGGCTG TG CC CTGGGTTTCTTCCTTCAG ATCTAC 
CTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGCCGAGC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATCCCCCTGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 
ATCTGGTTCGTGCCCGACTTCCCACACGGCCAGACCTATTGGTACCTGCTTTTCTATTGCCT 
CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCTGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTCTAGGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCCGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 
TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTTTTACAGAGCC 
TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 
TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FIGURE 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
IL I LGVREQREP YEAQQS EP I AYFRGLRLVMSHGPY I KL I TGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNEFQNLLLAIMLSATLTIPIWQ 

ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
I S TLSLDFAG YQTRGCSQ PERVKFTLNMLVTMAP I VL I LLGLLLFKMYP I DEERRRQNKKAL 
QALRDEASSSGCSETDSTELASIL 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 
GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTGA 
GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTTA 
TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTTT 
TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 
GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 
TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 
CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 
ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 
GAACCGAGCCCAT 
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FIGURE 14 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGT 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCT 
TTCATATTTTCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATAT 
CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGG 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAA 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 
TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCTTTCCTACCAAATG 
CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 
AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCACAGTGGCAATTTTGGGACTG 
ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 
ACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTTGGTTTTTTCCTGACTTACATTCGTGA 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 
CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 
TTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACTGATGAATGCTGATA 
ATCAGGAAACATGAAAGAAG CC AT TTGATAGATT ATTCT AAAGG AT AT C ATCAAG AAGACTA 
TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACTATG 
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FIGURE 15 

MWWFQQGLSFLPSALVIVH^SAAFI^ 

AAVLC I AT I YVRYKQVHALS PEENV 1 1 KLNKAGLVLG I LS CLGLS I VANFQKTTLF AAHVSG 
AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWNPEDKG YVLHM I TTAAE WSMS FS FFGF FLTY I RDFQK I S LRVEANLHGLTL YD 
TAPCPINNERTRLLSRDI 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCTCTCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGAAGCAGT 
TACC^GAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 
CGTTCCTGTTGATTTACAAAAGGTGCAGGTATGAGCAGGTCTGAAGACTAACATTTTGTGAA 
GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 
TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACACT 
CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGACATGCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCTTGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 
AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 
GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCTGCGGCGCCTA 
CCTCTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTGCCT 
TGTTTGGGCCATGGCTGGGGCTTCTGCTGTGCTGTGTGTTGACCTCGGTGGGTGCCACATGC 
TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 
GGCCCTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAGCTTGTTTTTTTTCTTATTGTTTT 
TGAGACTTTTCCCCATGACACCAAACTGGTTCTTGAACCTCTCGGCCCCAATTCTGAACATT 
CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCGGTTTGATCCCATATAATTTCATCTGTGT 
GCAGACAGGGTCCATCCTGTCAACCCTAACCTCTCTGGATGCTCTTTTCTCCTGGGACACTG 
TCTTTAAGCTGTTGGCCATTGCCATGGTGGCATTAATTCCTGGAACCCTCATTAAAAAATTT 
AGTCAGAAACATCTGCAATTGAATGAAACAAGTACTGCTAATCATATACACAGTAGAAAAGA 
CACATGATCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 
TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATTGCCTTCTATAGGTGATGTGGACACTGTG 
CATCAATGTGCAGTGTCTTTTCAGAAAGGACACTCTGCTCTTGAAGGTGTATTACATCAGGT 
TTTCAAACCAGCCCTGGTGTAGCAGACACTGCAACAGATGCCTCCTAGAAAATGCTGTTTGT 
GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 
ACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGAAATCCTGTCTCTAATAAAAAT 
ACAAAAATTAGCCAGGCGTGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 
AGGAGAATTGCTTGAACCAAGGTGGCAGAGGTTGCAGTAAGCCAAGATCACACCACTGCACT 
CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGI^LVFAGCTFALYLLSTRLPRG^^ 

KEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVLAGALFGPWLGLLLCCVLTSVGATC 
IFGKQLWSYFPDKVALLQRKVEENRNSLFFFLLFLRLFPMTPIWFIJTLSAPILNIPIVQFF 
FSVL IGLI PYNF I CVQTGS I LSTLTSLDALFSWDTVFKLLAI AMVALI PGTL I KKFSQKHLQ 
LNETSTANHIHSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGCCCCGCATGAATCATTGTAGTCAATCATTTT 
CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAA 
TAGGAAAATAACTTGGGATTTTATATTGGAAGACATTCATCTTGCTGCCAACGAGATCAGCA 
TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGAGACAGACCGGCCATCAGTGTGGCATG 
TCAGAGAAGGCAATTGAAAAATTTATCAGACAGCTGCTGGAAAAGAATGAACCTCAGAGACC 
CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCTCGCAACCTTGGGATTAATCT 
TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTCT 
GGAGCTCACACCTGGCGCTCACTCATCCATCACATTAGGCTGATGTCCTTGCCCATTGCCAA 
GAAGTACATGTCAGAAAATAAGGGAGTTCCTCTGCATGGGGGTGATGAAGACAGACCCTTTC 
CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 
AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 
ATTTGAGAGGCTCCATCCACTGGTGATCAAGACGGGAAAGCCCCTGTTGGAGGAAGAGATTC 
AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCCAAG 
TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTTATCCATGGAGGAGACCTCTGAA 
CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTTCACTCACCTGCCATTTCCAAAAGATG 
CCTCTTTAAACAAGTGCTCCTTTCTTCACCCAGAACCTGTTGTGGGGAGTAAGATGCATAAG 
ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 
GTGCCGAAGACATTGTCAGTCTGTGGCCATGCCAATAGAGCCAGGGGATATCGGCTATGTCG 
ACACCACCCACTGGAAGGTGTACGTTATAGCCAGAGGGGTCCAGCCTTTGGTCATCTGCGAT 
GG AACCGCTTTCTCAGAACTGTAGGAAATAGAACTGTGCACAGGAACAGCTTCCAGAGCCGA 
AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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FIGURE 20 

MDLAANE I S I YDKLS ETVDL VRQTGHQCGMS E KA I E KF I RQLLE KNE PQR PP PQ YPLL I WY 
KVLATLGL I LLTAYFVI QPFS PLAPE PVLSGAHTWRS L IHHI RLMSLP I AKKYMSENKGVPL 
HGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKH^ 

GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPYPWRRPLNRSQMLRELFPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFI IGSGEAMLQLIPPFQCRRHCQSVAMP 
I E PGD I GYVDTTHWKVYV I ARG VQPLVI CDGTAFS EL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACTTGGGATTTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTTG 
ATTTGGTGAGACAGACCGGCCATCAGTGTGGCATGTCAGAGAAGGCAATTGAAAAATTTATC 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FIGURE 22 

CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 

CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 

CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 

CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 

CACCCGAATGGCGCCACTTCATCGACAAACAGGTAC^GCCAACCATGTCCCAGTTCGAAATG 

GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 

TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 

TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 

AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 

CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 

GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 

CTGGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 

CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 

AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 

GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 

GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 

GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 

GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 

CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 

CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 

TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 

GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 

TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 

CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 

GGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 

ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 

AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 

GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 

AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 

GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 

AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 

AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 

TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 

GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 

TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 

GCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 

CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 

CCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 

ACCATGGGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 

GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 

ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 

CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 

GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 

CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 

GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 

CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 

TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 

CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 

GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 

GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 

GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 

TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 

GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 

CCTACTGAGGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 

GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 

GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 

GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNACYD^ 
RYTAVLKQQATQHSMALLHWGAL^^ 

HHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDEI^ 
MEAAELDEQREKLVLSAECQLVTWAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRP 
LAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQV 
RNQWSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWVQREISNFEYLMQLNTIAGRTYNDL 
SQYPWPVWLQDYVSPTLDLSNPAVFRDLSKPIGVVNPKHAQLVREKYESFEDPAGTIDKFH 
YGTHYSNAAG VMHYL I RVE P FTSLHVQLQSGRFDCSDRQFHS VAAAWQARLE S PAD VKEL I P 
EFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQALESEYVSAHLHEW 
I DL I FG YKQRGPAAEEALNVF YYCTYEGAVDLDHVTDERERKALEG 1 1 SNFGQT PCQLLKE P 
HPTRLSAEEAAHRLARLDTNSPS IFQH^^ 

SFSKDPTMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 
SQLSCHLDVVTCLALDTCGIYLISGSRDTTCMVWRLLHQGGLSVGLAPKPVQVLYGHGAAVS 
CVAISTELDMAVSGSEDGTVIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQIVVQSSA 
WERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDFVLLGTAQCALHILQLNTLLPAA 
PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQVSS 
GETEYNPTEAR 
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FIGURE 24 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCC^TAAATACATCATGCAACCC 
CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 
CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGC/^AATCTATGGGGTCCTGGGGCTCTTC 
TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 
CTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 
GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 
ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 
CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 
TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 
AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTGACAGA 
CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 
TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 
TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 
TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 
GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 
GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGC 
ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 
AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 
AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 
GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 
TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 
TCGCGCGACTGCACTCCAACCTGGGTGACAGACTCTGTCTCGAAAACAAAACAAACAAACAA 
AAAGATTTTATTAAAGATATTTTGTTAACTC 
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FIGURE 25 

RTRGRTRGGCEKVP INTS CNPTAHLVNS SCPGLM CVFQG YSS KGL I QRSVFNLQ I YGVLGLP 

WTLNWVLALGQCVLAGAFASFYWAFHKPQD I PTF PL I S AF I RTLRYHTGS LAFGAL I LTLVQ 

IARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 

NAFMLLMRNIWVVVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYY 

WLPIOTSILGAYVIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKN 
EAPPDNKKRKK 
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FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 
GGCTATGTTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 
TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 
CAGTGTGACCACGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGC 
ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 
ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCGATAGG 
CCAGTCAATGTCGTCAATGTATACAACGATACCCAGATCAAATTACTCATTAAACAAGATGA 
TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 
CAGGAAATGACAGTGATGGGTCAGAGCCTTCTGAGAAGCGCACACGGTTAGAAGAGGAGATA 
GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 
CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 
CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 
CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 
CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 
CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 
AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 
GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 
AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 
TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 
TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 
CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTAACCTGGACAAGCTG 
TACCATGGCCTGGAACTCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCTGC 
CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 
TCCAGATGTCATGCTGTTCTCTAGGCCGGCATCCCTAAGCCTGCTCAGCAAACACCTGCTCA 
AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 
CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 
GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 
TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 
CTGGACGCACTTATTTCCCTCCTGTCCTA^AATTTGATTCTTCCAGAATGACCTTCTTATT 
TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 
TTTTATTAAATAAAATGCTTATTTTAGGAAA 
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FIGURE 27 

MFVSDFRKEFTEWQSQRVLLFVASDVDALCACKILQ7VLFQCDHVQYTLVPVSGWQELETAF 

LEHKEQFHYFILINCGANVDLLDILQPDEDTIFFVCDSHRPVNVVNVYNDTQIK^ 

LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 

D YEQ YE YHGTS S AMVMFE LAWMLS KDLNDMLWWA I VGLTD QWVQDK I TQMKYVTD VG VLQRH 

VSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLW 

LQEFLADMGLPLKQVKQKFQAMDI SLKENLREMI EE SANKFGMKDMRVQTFS IHFGFKHKFL 

ASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSN^^ 

CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 

LSMEHGTVTWGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIE 

DALISLLS 
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FIGURE 28 

GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTGCGTGGAAGATCCTTCAGGCCTTGTTCGAGTGTGACCANGTGCAATATANGCT 
GGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 
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FIGURE 29 

CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCCAGTACCATTTTTTCTAGTGAAC 
CACGAAGGGACGAT AC CAGAAAAC AC C CTC AACC CAAAGGAAAT AG ACTACAGC CCCAATTG 
rCTGACTTTGGCTATAGAAAAAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCTAA 
GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCTTCTTGAGCTATTTACAGCTTTTAACAATT 
GAGTAAAGTACGCTCCGGTCACCATGGTGACAGCCGCCCTGGGTCCCGTCTGGGCAGCGCTC 
^GCTCTTTCTCCTGATGTGTGAGATCCGTATGGTGGAGCTCACCTTTGACAGAGCTGTGGC 
CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATCCTGCCCATGTATCCTCAG 
CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 
CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCTGCCAGGGTACATGGGCAGGGAGGG 
TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGCAAGGGTGACAAGGGGGAGATGGGCAGCCCCG 
GCGCCCCGTGCCAGAAGCGCTTCTTCGCCTTCTCAGTGGGCCGCAAGACGGCCCTGCACAGC 
GGCGAGGACTTCC^GACGCTGCTCTTCGAAAGGGTCTTTGTGAACCTTGATGGGTGCTTTGA 
CATGGCGACCGGCCAGTTTGCTGCTCCCCTGCGTGGCATCTACTTCTTC^GCCTCIAATGTGC 
ACAGCTGGAATTACAAGGAGACGTACGTGCACATTATGCATAAC CAGAAAGAGG CTGTCATC 
CTGTACGCGCAGCCCAGCGAGCGC^GCATCATC 

CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGCA 
ACGACTTCGACACCTACATCACCTTCAGCGGCCACCTCATCAAGGCCGAGGACGACTGAGGG 
CCTCTGGGCCACCCTCCCGGCTGGAGAGCTCAGGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 
TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTGGCATTCTGGGGAGA 
CCCTGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTCCTCTCTTCTCTCT 
TGGACCTATTTTAAGAAGCTTGCTAACCTAAATATTCTAGAACTTTCCCAGCCTCGTAGCCC 
AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCCGGGGTTCGTGTTAAATGCAGATTCT 
GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTGATGCTG 
ATGGGGTCAGTCTATGAACCACACTGGAGCAACCAGGTTCTAGGACTTTCTCAATATTCTAG 
TACTTTCTGAACATTCTGGAATCCTCCCCACATTCTAGAATTCTCCCAACATTTTTTTTTCT 
TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTCACTGC 

aIcctctgcctcccgggttcaagcgattcttctgcctcagcctccctagtggctgggato^ 

AGGCGCCTGCTACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATA 

TTGGCCAGGCTGGTCTTGAACTCCTGACTTCAGGTGACCCACCCGCCTCGGCCTCTCAAA^ 

GCTGGGATTACAGGTGTGAGCCACCGTGCCTGGCCAATTCCAACATTCTTAAATTCTCTCAT 

CCC?C?AGGGCTCCCCG^ 

TGCACCACTGCAGCC^CCGTTCATTTATTCATTCATTAAACACTGAGCACT 

GGG^CCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCT 

GTCCAGCCCAGGCGGGGAGAGATGTGTACATAGGTTTTAAAGC^ 

GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATTACCCACTGCTCCCCAAGGCTGG 

TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACCTGCCCAG 

TGCTC^A^GTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCTGTGAGGAGAGG 

ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCCCCGAAGAATGGAACCCACACCCA 

TAGCTCTCCCC^CAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTCCTCACTGGGA^ 

CTTCCTGCCTCCTCCCAGGGCTCTGCCAC^CCTTGCTC^GTCCCTTCCACa^GTCATCT 

GAACTTCCGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGTCCCCAGGTGCT 

CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTCTCCAGGCTTTATCAAGGTG 

ctaIggcccgggtgggcagctcctcgtctcagagccctcctccggcctggtgctgcctttac 

AAACACCTGCAGGAGAAGGGCCACGGAAGCCCCAGGCTTTAGAGCCCTCAGCAGGTCTGGGG 

aIStagagca^ 

GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTC 

CCATTGGTGCTCATGCAGACTCT(K3GGCTGAGGTGCCCCGGGGGGTGATCTCTGGTGCTCA^ 
AGCCGAGGGAGCCGTGGCTCCATGGCCAGATGACGGAAACAGGGTCTGACCAAGTGCCAGGA 
AGACCTGTGCTATAAACCACCCTGCCTGATCCTGCCCCTGCCTGACCCCGCCACGCCCTGCC 
GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAAAAAAAA 
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FIGURE 30 

MVTAALGPWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 
FAFSVGRKTALHSGEDFQTLLFERVFVNLDGCFDl^TGQFAAPLRGIYFFSLNVHSWNYKET 
YVHIMHNQKEAVILYAQPSERSIMQSQSVMLDLAYGDRVWVRLFKRQRENAIYSNDFDT^ 
FSGHLI KAEDD 



Important features: 
Signal peptide : 

amino acids 1-20 ' 



N-glycosylation site. 

amino acids 72-75 

Clq domain proteins* 

amino acids 144-178, 78-111 and 84-117 
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FIGURE 31 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 

AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTG 

CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 

GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGG 

ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 

CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 

AGAACCAGATCGCCAGCCTGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 

GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 

CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 

TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 

CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 

CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 

AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 

CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 

CCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 

TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 

CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 

GGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGCCCA 

AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 

ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCTTCTAG 

CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 

CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTC 

AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 

CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 

CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGC 

GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 

TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 

AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 

CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 

CTCCAACCACGCCCCAGTCACCCAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 

CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 

GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCT 

GGAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGGCCCGAAGGCAACAGAGGGCGGTGGAG 

AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 

CAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCAGAGAGAGACAGGGCAGCTGGGGCCG 

GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 

CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 

CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 

CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 

GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 

CCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGCGGCTGTG 

TGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 

GGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 

GGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 

AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 32 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 

GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLAN^ 

RLERLYLGKNRIRHIQPGAFDTLDRLLELKLQ 

GILDTANVEALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAG 

NTRIAQLRPEDLAGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLS 

PWVRESHVTIASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTAWPTTRPVVREPTA^ 

SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPE 

GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 

TYRNLSGPDKRLVTLRLPASLAEYTVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPA 

VH SNHAP VTQ AREGNLPLL I AP ALAAVLLAALAAVG AA YCWRGRAMAAAAQDKGQ VG PG AG 

PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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FIGURE 33 

GAATCATCCACGCACCTGC^GCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCAT 
CTTCATGATTCATATGAGGAAATAAGTGGTAAAATCCTTGGAAATACAATGAGACTCATCAG 
AAACATTTACATATTTTGTAGTATTGTTATGACAGCAGAGGGTGATGCTCCAGAGCTGCCAG 
AAGAAAGGGAACTGATGACCAACTGCTCCAACATGTCTCTAAGAAAGGTTCCCGCAGACTTG 
ACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCAGA 
TTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGG 
ATCTCAAAACCTTTGAATTCAACAAGGAGTTAAGATATTTAGATTTGTCTAATAACAGACTG . 
AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAGGTATTTAGATCTTTCTTTTAATGACTT 
TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCTAGGTTTGA 
GTGGGGCAAAAATACT^AAATCAGATTTCCAGAAAATTGCTCATCTGCATCTAAATACTGTC 
TTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAAC 
AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCA 
AGACTTGAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTGTAAGTTATGAA 
ATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGTTCTATTGCTTAATAAAGTTGA 
TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGGAACACT 
TTCAGATCCGAAATGTGACTTTTGGTGGTAAGGCTTATCTTGACCACAATTCATTTGACTAC 
TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCA 
ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATAGAAAACCTGACAATATCAAATG 
CACAAATGCCACACATGCTTTTCCCGAATTATCCTACGAAATTCCAATATTTAAATTTTGCC 
AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTGAAAACTCT 
CATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCTTTGCTAACAACACAC 
CCTTGGAACACTTGGATCTGAGTCAAAATCTATTACAACATAAAAATGATGAAAATTGCTCA 
TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAG 
GTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAATAATAACCAAATCCAAACTGTACCTA 
AAGAGACTATTCATCTGATGGCCTTACGAGAACTAAATATTGCATTTAATTTTCTAACTGAT 
CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAT^ATGAACTTCATTCT 
CAGCCCATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAA 
ATCCATTCCGGTGTACCTGTGAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTAG 
GTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCACCATTG 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCC 
TGGTATCTCAGGATGCTAGGTCAATGCACACAAACATGGCACAGGGTTAGGAAAACAACCCA 
AGAACAACTCAAGAGAAATGTCCGATTCCACGCATTTATTTCATACAGTGAACATGATTCTC 
TGTGGGTGAAGAATGAATTGATCCCCAATCTAGAGAAGGAAGATGGTTCTATCTTGATTTGC 
CTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAGTGAAAATATTGTAAGCTTCATTGA 
GAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAATGAGTGGTGCCATT 
ATGAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTGATCATATAATTCTTATC 
TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCT 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGTAAATGTGGGCTTTTCTGGGCAA 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 
TTCACAGAGTTAAATGAAGAGTCTCGAGGTTCTACAATCTCTCTGATGAGAACAGATTGTCT 
ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATA 
CAACCTTTATGATGGCAATTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATA 
TCAGTTTCTAGAAGGATTTCTAAGAATGTATCCTATAGAAACACCTTCACAAGTTTATAAGG 
GCTTATGGAAAAAGGTGTTCATCCCAGGATTGTTTATAATCATGAAAAATGTGGCCAGGTGC 
AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGTCAA 
GAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 
GCTGGGCGTGATGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGT 
GACAGAGCGAGACTCCATCTCAAAAAAAAGAAAAAAAAAAAAGAAAAAAATGGAAAACATCC 
TCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAATATTA 
CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATAT 
GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTG71AATGGCATTGAA 
ATAAAAGTGTAAAGAAATCTATACCAGATGTAGTAACAGTGGTTTGGGTCTGGGAGGTTGGA 
TTACAGGGAGCATTTGATTTCTATGTTGTGTATTTCTATAATGTTTGAATTGTTTAGAATGA 
ATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 

LQS SDFHS VS KLRVL I LCHNR I QQLDLKT F E FNKELRYLD LSNNRL KS VT WYLLAGLR YLDL 

SFNDFDTMPICEEAGNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 

ILNTTKLHIVLPMDTNFWVLLRDGIKTSKILEMTNIDGKSQFVSYEMQRNLSL^ 

IjNKVDLLWDDLFLILQFVWHTSVEHFQIRNVTFGGKAYLDHNSFDYSNTVMRTI 

VFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPNYPTKFQYLNFANNILTDELFKRTIQLP 

HLKTLILNGNKLETLSLVSCFAmTPLEHLD^ 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 
MNFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYTCEYPLN 
LRGTRLKDVHLHELSCNTALLIWIWIMLVL^ 

RKTTQEQLKRNVRFHAFI SYSEHDSLWVKNEL I PNLEKEDGS I LI CLYES YFDPGKS I SENI 
VSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHK 
LKALLEKKAYLEWPKDRRKCGLF>7ANLRAAINVNVLATREMYELQTFTELNEESRGSTISLM 

RTDCL 
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FIGURE 35 A 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 
CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 
CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 
GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 
CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 
CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 
GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 
GGCAGGAGATCTGGAGCCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTC 
TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 
GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 
CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 
AACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 
TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 
TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 
GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 
GACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTGGCTGAGTGACTGGGTGA 
CATCCTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGGCTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGG 

aacgagatgaccaccactgatgacctggattttaagcaccacaattataaggaaatgcgcca 

gttgatgaaagttgtgaatgaaatgtgtcccaatatcaccagaatttacaacattggaaaaa 

gccaccagggcctgaagctgtatgctgtggagatctcagatcaccctggggagcatgaagtc 

ggtgagcccgagttccactacatcgcgggggcccacggcaatgaggtgctgggccgggagct 

gctgctgctgctggtgcagttcgtgtgtcaggagtacttggcccggaatgcgcgcatcgtcc 

acctggtggaggagacgcggattcacgtcctcccctccctcaaccccgatggctacgagaag 

gcctacgaagggggctcggagctgggaggctggtccctgggacgctggacccacgatggaat 

tgacatcaacaacaactttcctgatttaaacacgctgctctgggaggcagaggatcgacaga 

atgtccccaggaaagttcccaatcactatattgcaatccctgagtggtttctgtcggaaaat 

gccacggtggctgccgagaccagagcagtcatagcctggatggaaaaaatcccttttgtgct 

gggcggcaacctgcagggcggcgagctggtggtggcgtatccctacgacctggtgcggtccc 

cctggaagacgcaggaacacacgcccacccccgatgaccacgtgttccgctggctggcctac 

tcctatgcctccacacaccgcctcatgAcagacgcccggaggagggtgtgccacacggagga 

cttccagaaggaggagggcactgtcaatggggcctcctggcacaccgtcgctggaagtctga 

acgatttcagctaccttcatacaaactgcttcgaactgtccatctacgtgggctgtgataaa 

tacccacatgagagccagctgcccgaggagtgggagaataaccgggaatctctgatcgtgtt 

catggagcaggttcatcgtggcattaaaggcttggtgagagattcacatggaaaaggaatcc 

caaacgccattatctccgtagaaggcattaaccatgacatccgaacagccaacgatggggat 

tactggcgcctcctgaaccctggagagtatgtggtcacagcaaaggccgaaggtttcactgc 

atccaccaagaactgtatggttggctatgacatgggggccacaaggtgtgacttcacactta 

gcaaaaccaacatggccaggatccgagagatcatggagaagtttgggaagcagcccgtcagc 

ctgccagccaggcggctgaagctgcgggggcggaagagacgacagcgtgggtgaccctcctg 

ggcccttgagactcgtctgggacccatgcaaattaaaccaacctggtagtagctccatagtg 

gactcactcactgttgtttcctctgtaattcaagaagtgcctggaagagagggtgcattgtg 

AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTCCCATTTA 
TCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAGCC 
AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGC 
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FIGURE 3SB 

ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGC 

aatttgcacagctaaaattgcagcatttcccc^gctgggctgtcccaaatgttaccatSga 

GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 

CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 

TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTC^GAAAAGGAGAGAGAGGCT^^ 

GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATA 

^^^^^?^^^^^^^^^^^^®^^CTGGAGGTTCCCCAGGTAGGGAGAGCCA 

GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 

GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 

GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 

GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 

CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 

AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 
CTTAAAAAAA ^AAAA^ AAfaACC 
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FIGURE 36 

MSRPGTATPALALVLLAVTLAOT^ 

AGPGEEV^RRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 
VRVAREDVRESCPPLGLETLKITDFQLHASWKRYGLGAHRGRLNIQAGINENDFYDGAW^ 
GRNDLQQW I E VDARRLTRFTGV I TQGRNSLWLSD WVTS YKVMVSNDSHTWVTVKNGSGDM I F 
EGNSEKE I PVLNELPVPMVARYIRINPQSWFDNGS I CMRME I LGCPLPDPNNYYHRRNEMTT 
TDDLDFKHHNYKEMRQLMKVVNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEF 
HY I AGAHGNE VLGRELLLLL VQ FVCQ E YLARNAR I VHL VE ETR I HVLP S LNPDG YE KA YEGG 
SELGGWSLGRWTHDG I D I NNNFPDLNTLLWEAEDRQNV PRKVPNHY I A I PEWFLSENATVAA 
ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 
HRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 
QLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGI PNAI ISVEGINHDIRTANDGDYWRLL 
NPGEYVVTAKAEGFTASTKNQWGYDMGATRCD^ 
LKLRGRKRRQRG 
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FIGURE 37 

CTAAGAGGACAAGATGAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 

AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 

CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 

CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 

ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 

GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAGAACTTTCTAAAGT 

GAGGGAATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 

TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 

GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 

AATTGTTGACCAGCTGGAGGTGGAGATAAGAAATATGACTCTCTTGGTAGAGAAGCTTGAGA 

CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 

AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 

GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 

GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 

GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 

CAACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 

AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 

ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTCCCTAATGCTGC 

CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 

AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCACTGGTAACATGGTGATTAGTAAA 

CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 

TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 

AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 

CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 

CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAGTAAGCTG 

TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 

ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 

CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 

CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 

GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 

AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 

TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGGAAGAAACCTGGGGACTTAGTTAG 

GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 

CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 

TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAA 

TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 

ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 

AACCAGACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 

TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 

GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 

TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 

TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 

GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 

TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGTTTAAATGCTTTAC 

TCCCCCTTTTAAAATAAATGATTAAAATGTGCTTTGAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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MRPGLSFLLALLFFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSFSSSSRSGSSSSRSLGS 
GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 
QL I SVYEKKLLNLTVR ID IMEKDTI S YTELDFEL I KVE VKEMEKLVIQLKES FGGSSE I VDQ 
LEVE IRNMTLLVEKLETLDKNNVLAI RRE I VALKTKLKECEASKDQNTPWHPPPTPGSCGH 
GGVVNISKPSWQLNWRGFSYLYGAWGRDYSPQHPNKGLYWVAPLNTDGRLLEYYRLYNTLD 
DLLL Y I NARE LR I TYGQGSGTAVYNNNM YVNM YNTGN I ARVNLTTNT I AVTQTLPNAAYNNR 
FSYANVAWQDIDFAVDENGLWIYSTEASTGI^^ISKLNDTTLQVLNTWYTKQYKPSASNAF 

MVCGVLYATRTMNTRTEEIFYYYDTNTGKEGKLDIVMHKMQEKVQSINYNPFDQKLYVYNDG 
YLLNYDLSVLQKPQ 
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FIGURE 39 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCCTGTCGTCCAC 
CCTCCTCCC^CTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTCAGCTCAACTGGAGAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCGAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACA 
TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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FIGURE 40 

TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 

oH^^S^^^^^^^^^^^^^^^GGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
GGATACCATGTTTGTGTGGAAGTGCCCCGTGTTTGCTATGCC(^TC 
AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTG^ 
TGTAATGTTGATACCAGGAATGGAAGAACAACTGA^^ 

AGAAACMTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAM 

TCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCTTTAAATTTGCTGCAGCAATTGC^TTA 

TTATTGOOGCATTCTTCaTTCCAGAAGGAAC^ 

GCAGGTGCCTTTTGTTTC^TCCTCATACAACTAGT 

GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGTTGGTATG^AG 
TATCAGCTACAGCTCTGAATTATCTGCTGTCTTTAGTTGCTATC^ 

TAC^CTCATCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTC^^ 
CGTTGGTGCTTCTGTAATGTCTATACTGC 

GAACCAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTGGCTACAATACAAC^ 

TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCA^ 

CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGT 

ACTGGAGGATGGGGACGATGTTCACCGAGCTGTAGATAATGAAAGGGATGCT^ 

GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCACT 

AACTGGTCCAGGTATGAACCCTCTCGTGAGATGAAAAGTCAGTGGACAGCTCT 

AATCTCTTCCAGTTGGATTGGCATCGTGCTGTATGTTTGGACACTCGTG^G^CACCA^ 

H ACAAA T CGTGATTTTGAC ^G™^ 
JTATTTGAAAACMTATTCOCAACm^ 

J1^ CC ^ GTGTTCTGG CATGAATTAGATTTTACTGCTTGTCATTTTCT 
GTGCATTGATATGTGAAGTAGAATGAATTC^ 

TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGACT 

GCAAATGTATGGCTGCCTTTTGAAATATTTGATGTGTTGCCTGGCAGGATACTGCAAArAAr 

A ^^ TA J" TTTAAAATTTATAAACAAGTCACTT ^ 
AGGTTTTACCCTTGATACGGAATTTACAC^^ 

TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGTAAATAATiCT^^^roG^^^A^ATOr 
CCTTTGCCAACAAAGTGAACTGTTTTGGTTCTm 

AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTT^ 

aagtgttttgaaagtcacttt^ 

AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGCCCAGGAATra^ 
TGGCACATGGTGAACCTGTTCTATAAAAATAATCTGGCTTTC^ 

cactgagaggctagtgaagattgctgagcccagagcca^ 

CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATATATAT^ 
CAAAATTTTGACAGGGAAGGAAGTAACTGCAAAACCACTAGGCTTT^ 

aaatctagtccagttctctcatttaaaaaaatgaagacactgaaatS 

CAGATAGCTAATTAGGAAATTTCAAGTTGGCCAATAATAGCATTCTCTCTGACAT^A^AA 

^-^^^'^'^^-^'^'G! catattgatt ta<^^ctcStactgtgata^ttaatgtgat 

ATAAACTCCTGCTTATAGTATACTACACAGTTCAAAAGATGTTTAAAATGCTTTTGTATTTA 
S C S3 AATTGAAATATATAGATTATTGTAA ^ 
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FIGURE 41 

MGSVLGLCSMASWIPCLCGSAPCLLCRCCPSGNNSTWRLIYALFLLVGVCVACVMLIPGME 
EQLNKI PGFCENEKGWPCNILVG YKAVYRLCFGLAMF YLLLSLLM I KVKSSSDPRAAVHNG 
FWFFKFAAAI AI I I GAFF I PEGTFTTVWFYVGMAGAFCF I LI QLVLLI DFAHSWNESWVEKM 

EEGNSRCIWYAALLSATALNYLLSLVAIVLFFVYYTHPASCSENKAFISVNI^LCVGASVMSI 
LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLSIIGYNTTSTVPKEGQSV 
QWWHAQGI I GLI LFLLCVFYSS IRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 

RAVDNERDGVTYSYSFFHFMLFLASLYIMMTLTNWSRYEPSREMKSQWTAVWVKISSSWIGI 
VLYVWTLVAPLVLTNRDFD 
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GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 43 

GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 
ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGTAAT 
GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 
GTCCTTGGGGTTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 
TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 
TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAACAACT 
GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 
GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTA 
CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 
CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 
AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACATTTTTGGTTGGC 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACT 
AATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCT 
TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATTCTTCATTCCAGAAGGAACTTTT 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 
CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 
ACTCGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCTGCTGTCTTTA 
GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAACAAGGC 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 
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FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 

GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGT 

CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 

GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 

CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGTTTCCAGCCA 

AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 

CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 

TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 

GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 

TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATGGGCCCCCCGA 

GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 

CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACAT 

CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 

CCCGGCGCATCATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATG 

TGCTGCCCCTCACGCTCCTCCATCCTCACTGGCAAGTACGTCCACAACCACAACACCTACAC 

CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 

TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 

GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 

TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 

ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTAC 

CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 

CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 

CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 

GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 

GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 

CCGCCGACCACGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATAT 

GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 

TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 

TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 

CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 

GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 

ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 

CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 

CATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGGGCAGGGCA 

GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 

CTCTTCAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCGCTCAGTGGCCAT 

CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 

CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGGACTTCAGT 

GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 

CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 

AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 

AGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAG 

CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 

AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 

CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 

CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 

GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 
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FIGURE 46B 

CTCTTCTGTGAATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 

GCTGATGAATGCAGTGAACACACTGGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 

TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 

GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 

ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGG 

TGGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 

ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTC 

AACATGACAGATTCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 

GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAAATGCATTTTTTCGTATCAA 

AAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 

GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 

AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 

GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 

ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCTTC 

ATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAATAAG 

AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGSMQ 
VMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHES 
RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSD 
YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITP 
S YNYAPNPDKHW IMR YTG PMKP I HME FTNMLQRKRLQTLMS VDDSMET I YNMLVETGELDNT 

YIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDI 
AGLD I PADMDGKS I LKLLDTERPVNRFHLKKKMRVWRDS FLVERGKLLHKRDNDKVDAQEEN 
FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKY 
YGQGSEACTCDSGD YKLS LAGRRKKLFKKKYKAS YVRSRS I RS VAI EVDGRVYHVGLGDAAQ 
PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKS 
LQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSL 
HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLG 
PFCACTSANNNTYWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQL 
HVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 
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FIGURE 4S 

AACAAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 

GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGATGCTGGGCCTCCTGGGGAGCACAGCCC 

TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 

TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 

CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 

ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 

CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 

CGCTCGCTGAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 

TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 

GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 

AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 

GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 

ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 

ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 

GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 

AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 49 

MLGLLGSTALVGWITGAAVAVLLLLLLLATCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLG I FHHHRHPGHVSHVPNVGLHHHHHPRHT PHHLHHHHHPHRHH PRHAR 
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FIGURE 50 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTA 

CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 

GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 

CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 

TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCC 

CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 

CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGC 

CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCG 

TGTCCATCCTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 

AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 

GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 

GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 

CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 

GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 

GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 

CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCC 

ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 

CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 

TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 



ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAA 
AAAAAAAAAA 
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FIGURE 51 

MSDLLLLGLI GGLTLLLLLTLLAFAGYSGLLAGVEVSAGS PP I RNVTVAYKFHMGL YGETGR 
LFTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFS 
FPAPSHWTATFP YTT I LS I WLATRRVH PALDTY I KERKLCAYPRLE I YQEDQ IHFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
SRGWDDGDTRSEHSYSESGASGSSFEELDLEGEGPI/3ESRLDPGTEPLGTTKWLWEPTAPEKGKE 
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CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 
GCCCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 
GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 
GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 
GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 
GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 
GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 
GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 
AGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATA 
ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE S3 

MTLRPSLLPLHLLLLLLLSAAVCRAEAGLETESPVRTLQVETLVEPPEPCAEPAAFGDTLHI 
HYTGSLVDGR I IDTSLTRDPLV I ELGQKQV I PGLEQSLLDMCVGEKRRAI I PSHLAYGKRGF 
PPSVPADAWQYDVEL IALI RANYWLKLVKGI LPLVGMAMVPALLGLIGYHLYRKANRPKVS 
KKKLKEEKRNKSKKK 
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CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 

CCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCTGC 

TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 

CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGAGA 

CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 

CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 

CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 

AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 

TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 

ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 

CCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 
AATAATAAATTTTAAAAAACTTA 
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FIGURE 55 

CCGAAAGTCCCGTCCGGACCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCC 
GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACG 
TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGA 
TTCCAGGTCTGGAGCAGAGTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATT 
CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 
GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 
GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 
CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAGAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACTTAAAA 
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FIGURE 56 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 
TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 
ACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 
GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 
TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 
GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 
GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 
TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 
CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 
TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 
TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 
TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 
AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 
ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 
TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 
TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 
CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 
TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 
GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 
GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 
CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 
CACATCCACCACTG 
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FIGURE 57 

MSGFLEGLRCSECIDWGEKRNTIASIAAGVLFFTGWWIIIDAAVIYPTMKDFNHSYHACGVI 
ATIAFLMINAVSNGQWGDSYSEGCLGQTGARIWLFVGFMLAFGSLIASMWILFGGYVAKEK 
DIVYPGIAVFFQNAFIFFGGLVFKFGRTEDLWQ 
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TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 
CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 
TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 
TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGCAAATACTATTGCTTCCATTGCTGCTGGTGTANTATTTTTTACAGGCTG 
GTGGATTATCATAGATGCAGNTGTTATTTATCCCACCATGAAAGATTTCAACCANTCATACC 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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FIGURE 60 



GGACACCGGGTTCCGGACCAATGC^USrGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 
GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 
CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGi\AAAAGACATAGTAT 
ACCCTGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATANTATTGCTTCC 
ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTTNAACCANTCATACCATGCCTGTGGTGTTATAGCAACCATAGCC 
TTCCTAATGATTAATGCAGTATNGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTG 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTTTCATGTTGGCCTTTGGATCTN 
TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 
CCTGT 
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FIGURE 62 

GGGAGGCTGTGNCCGTTTTGTTTTNTTGGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 
CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 
GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 
GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 
TATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA 
TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 
GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 
ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 
TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 
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FIGURE 63 

Sotggctg^ctgc^ 

ACCAAGGGAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGG^T 

cca™cat^ggcgtatggaagagggcagcattggccgttttatcttggaccagatcact 

aag^cagctggactgggctcccctgtcctctccttttgacatcatggtactggaagg^ 

aa^ccgaaaggagtaccccatgtacagtggagagaaagcctacattcagggcctcaagg^^ 

r^arcC^^ATGCCATCCTGTTGAAATTCCTCCCATTGCCCGTGGTTCAGCTCCTCGAC 

agotg?S§c?gctgaScg^^^ 

GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACTCAGCTACATCTTCCCCA 

SacggtgtcS 

ATGAAAGGAGGCT^TATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCA 

cStcgS^ 

cccatcgtggtctccaacgcaggactgttcaacacctatgaacacctactgccgggga^ 

rrGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 

GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATGCCCAGGG 
?rrTOCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 

aggIccga^ccSgg^ 

rArrAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 
CTTTGTGGAAGCCTCTATGTCAGTGG^^ 

SIrTGTOACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 
TG^TACGGGGCTGACCATGACCTGG^CCGCCTG 

S™CGCOTTOTAACATGCCATCCCTACTAATAGGATATT(^ 
TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAG^ 

I?Ittccatctgtggatagaacccctggcagtc 

TrTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 

gSSgSSaSaatgaScttaactcaa^ 

GOT^GGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 
GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 

Sg?g?atgtcagatga^ 

TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACC^^ 

???a?ccacca1Kt^ 

TGGACT^TAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAG 

CACAGCAGGG^AC^GTGCAGGGAGGTGTGGGGTAAGGGAGGG 

AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACA 

™gacagg^ggtgaatgcaagctcaaggtttggaaaaatgacttttcagt^^ 

GTATCAGACATACGAAAGGTCTCT 
ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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FIGURE 64 

IWLPLVLLLAVLLLAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 
QVPEKLDVWI GSGFGGLAAAA I LAKAG KRVLVLEQHTKAGGCCHTFGKNGLEFDTG I HY I G 
RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 
EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 
LGAS SELQAVLS YI FPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEI AFHTI PVIQRA 
GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCLP 
GVKQQLGTVRPGLGMTSVF I CLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 
IPLLFFAFPSAKDPTWEDRFPGRSTMIMLIPTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 
SMSVVLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACYGADHDLGRLHPCVMASLRAQSPI 
PNLYLTGQDI FTCGLVGALQGALLCSSAI LKRNLYSDLKNLDSRI RAQKKKN 
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FIGURE 65 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 
GGGGTTGGCACCGGCCCCGAGAGGAGGATGCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 
TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 
TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 
GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGAATTAGAATCCTCTATTCAAGA 
AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCAGCTTTCTAG 
AGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTACGGAAACCAGCTTTG 
ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTTTTCCTAGATAA 
GGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTACAACCT 
ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCTAAGAGA 
CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 
TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 
CAGGCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGC 
TCTTGGCTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 
ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 
TAGTGGAAGGCTAATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTT 
TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAA 
ATTCTTGTTAATGGATATAACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACA 
ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 
TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 
TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 
GAAATATTATACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGAGTTATTT 
GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 
CCATTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 
CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATTAACTCATTTTTAATAAA 
ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEESELESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FIGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 
GCCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCT 
GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGG 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 
TACGGCTTTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 
GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
TTAATAATATTCAACATGTCAACAAC 
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FIGURE 68 

MACRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHWIRDYGVSWYQQRA 
SAPRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNACVLTISPVQPEDDADYYCSVGYGFSP 
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FIGURE 69 

AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGC^^ 

GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCG^CCAT^ACAGC^CTGGA 

GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTCGAGCG^AGGATO 

GTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGA^ 

GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGG^ 
AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACAC^ 

CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCG^ 

tcggcccgacatcacgtggatgaaggacgaccaggccttgacgcgSgagg 
ccaggaagaagaagtgga^ctgagcctgaagaaccVgcggccggag^ 
acctgccgcgtgtcgaaccgcgcgggcgccatcaacgc^cctacaa^ 
gcggacccgttccaagcccgtgctcacaggcacgcaccccg^^^ 

GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGAOT^^^ 

AAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATC^ 

S™^ TGCTGCCCACGG ^^ 

TSHI G ^ CACCCGTG CCCGCCAGGACGATGCGGGCATGTACATCTGCS^^ 
ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAi^C^CGCCArrrrr 

S^ G ^ GGCOTCCTC ^^ 

CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCC^ 

tgcacccccgcgcctgcccctcccctgcctgggcaccgcccgcc^^ 

CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCOT 
GTGAGGAGCATCX^TCTCCGGCAGCC^ 

CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACAC^ 
ACA F CACACGTGGAGGG CAAGG^^ 

ATCTGCAGTC^GCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGAT^ 
S CAGACGAAGG CAGGGGACCCATGGCGAGGAGGAATGGCCAGCACC^^ 

TGAGGCATAGCCCCTGGACACACACACACAGACACACACACTACCTGGATGCATOT 

ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACG^ 

CCGCCTGGGCACACAGATAAGCTGCCCAAATGCACGCACAC^CAGAGACATG^ 

tacaaggacatgctgcctgaacatacacacgcacacc^tgcgcaS 

CACACACACACACGGATATGCTGTCTGGACGCACA^A^TGCA^ 

ATATTGCCTGGA(^CACACACACACACACGCGTGCACAGATATGCTGTCT^ 

GCCTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGA^ 

acacaosc^tgcacagatatgctgtccggacacacacacgcacg^ 

J CACACACAGATAA TGCTGCCTCAACACTCACACACGTGCA^AT^^^ 
J G ^ A £ AGATATGCTGTC ^ 

cacgcacac^tgcagatatgctgcctgggcacacacttccggacacacatS^ 

GCAGATATGCTGCCTGGACACIACACACAGATAATGCTGCCTCAACAC^ 
TATTGCCTGGACACACACATGTGCACAGATATGCTGTCTGGACATCcI^ 

tgctgtccggatacac^cgcacgcacacatgcagatatgctS 



JSJS g J ctctcctggggcccgggacccgc ctggtctt™ 

gtccaggccagacaccaccccccaccccactgtcgtggtggc^cagatctctc 
tgtagagtttgagctgaagccccgtatatttaatt^ 
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MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKWPRQVARLGRTVRLQCPVEGDPPPLTM 

WTKDGRTIHSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLWLDDISPGK 

ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 

DDQALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 

TGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKFWLPTGD 

VWSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 

TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 

LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 
HQHIHYQC 
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S G S^ G S AGTGAGGCCAGCG ^GAGGGTCAGCACTGCATGGGC^ 



r C ?? C ^S^ AAGACTTGTGCAGGGCAGCAGGAT AATGG^ 

CCCATCACTTTGGAAGCCATGGAGACCAACATCATCCCCCTGG^^ 
S C ^ A I GGCTGAACTGGAGATTCCATCC AGGAGTTTCTAC^ 

^? AAAAGTGAAGGCCAGTC ^^ 

• S CGGAGA S TGACCTGAACTTCATCAATGACG AAGGAGACA^ 
CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCT(^GAG^^ 

====== 

GCGATGGAATTCA(^GCCTGCTATATCAAGGTGA^ 
GCGA J CC ^ CAACAT ^^ 

ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCT^ 
™™ AGC ? GCCGTCGAGCCAGT ^ 



WO 99/63088 



FIGURE 71 R 



PCT/US99/12252 



====== 

CATTAATGC^TATACTTCCTCTTCTTTTTGCATGGm^ 
AATCTGATGCTGAAGATCAAATAACCAATATAAAGCATA^ 

c?ctgtaS?gaaaaa? tagaaacccagtgtagccca gggcagatgtcaataaatgcata 
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MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPS I FAKPADTLES PGEWTTWFNI 
DYPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQWHGSPREGFWCLNREQ 
RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFR I PGLCPDGKS I LKITKVKFAP I VLTMPKTSLKAATIKAEFVRAETPY 

MVmPETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAG 

EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 

KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 

SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFN 

KKGSAVFHEIKMLRRKEPITLEAMETNIIPLGEWGEDPMAELEIPSRSFYRQNGEPYIGKV 

KASVTFLDPRNISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPLNAGKVKVHL 

DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 

DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 

CVPAFCDDQSPDAYSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPR 

VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 

EDDPMSWTEDYLAWWPKPMEFRACYIKVKIVGPLEVNVRSRNMGGTHRRTVGKLYGIRDVRS 

TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 

NNDTSEYTMLAPLDPLGHNYGIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALT 

FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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FIGURE 73 

CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 

CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 

CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 

ATATGTAAACTAGACAAAGATCGCAGATCATAAAGCAAGCTCTGCTTTAGTTTCCAAGAAGA 

TTACAAAGAATTTAGAGATGTATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 

GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 

GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 

TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 

ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 

GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCACATTTTGGC 

AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCTGTCTTGGAGC 

AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACCAAAT 

GATCCTGGAGAAGTCTCTCGATTATGGACGAACATGGCAGCCCTATCAGTATTATGCCACAG 

ACTGCTTAGATGCTTTTCACATGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 

TTAGAAATCATTTGCACAGAAGAGTACTCAACAGGGTATACAACAAATAGCAAAATAATCCA 

CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 

TCTACGGACAGCTGGATACAACCAAGAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 

ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 

CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 

TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 

TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 

CCCCAAAGGCACTGCAAATACCTGTATCCCCAGTATTTCCAGTATTGGTACGAATGTCTGCG 

ACAACGAGCTCCTGCACTGCCAGAACGGAGGGACGTGCCACAACAACGTGCGCTGCCTGTGC 

CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 

CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 

TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 

GTGCCGTGGGGAAGCAGACACAACCCAAACATTTGCTACTAACATAGGAAACACACACATAC 

AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 

CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 

TGTTGATATTATCACTGCAAATCACATTGCCAGCTGCAGAGCATATTGTGGATTGGAAAGGC 

TGCGACAGCCCCCCAAACAGGAAAGACAAAAAACAAACAAATCAACCGACCTAAAAACATTG 

GCTACTCTAGCGTGGTGGGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 

TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 

GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 

GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAACAGCCCCCTCTAAAAGC 

GCAAGCCAGTCATACCCCTGTATATCTTAGCAGCACTGAGTCCAGTGCGAGCACACACCCAC 

TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAAATGAAGTT 

ATTTTTCTTGAACTACTGTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTTACCAGA 

CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 

TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 

GAATATCAGTTTACATATATAACAAGTGTAATAAGATTCCACCAAAGGACATTCTAAATGTT 

TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 

AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAGCCACCAAGCAGTTTCACACTCACTTT 

ACTGATTTCTGTGTGGACTGAGTACATTCAGCTGACGAATTTAGTTCCCAGGAAGATGGATT 

GATGTTCACTAGCTTGGACAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 
CAACAGCAAAAAAAAAAAAAAAAAAAAAA 
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MYLSRSLS IHALWVTVSSVMQPyPLVWGHYDLCKTQ I YTEEGKVWDYMACQPESTDMTKYLK 
VKLDPPDITCGDPPETFCAMGNPYMCNNECDASTPELAHPPELMFDFEGRHPSTFWQSATWK 
EYPKPLQVNITLSWSKTI ELTDNI VI TFESGRPDQM ILEKSLDYGRTWQPYQYYATDCLDAF 
HMDPKSVKDLSQHTVLEI ICTEEYSTGYTTNSKI IHFE I KDRFALFAGPRLRNMASLYGQLD 
TTKKLRDFFTVTDLRI RLLRPAVGE I FVDELHLARYFYAI SD I KVRGRCKCNLHATVCVYDN 
SKLTCECEHNTTGPDCGKCKKNYQGRPWSPGS YLPI PKGTANTCI PS I SSIGTNVCDNELLH 
CQNGGTCHNNVRCLCPAAYTG I LCEKLRCEEAGS CGSDSGQGAPPHGTPALLLLTTLLGTAS 
PLVF 
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CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 

CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 

CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 

CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 

AGAGTTTAGCCTTGCGACTGGCCGAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 

AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 

AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 

CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 

AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 

TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 

ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 

CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 

CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 

TGGACCAAACCTCGTGAGCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 

CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 

AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 

ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 

TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 

GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 

TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 

TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 

TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAACTCAGTAGTACAACC 

TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 

AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAAAAAAAAAAAA 
AAAAG 
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FIGURE 76 

MSVIFFACWRVRDGLPLSASTDPYHTQDFLEWRRRLKSLALRLAQYPGRGSAEGCDFSIHF 

SSFGDVACMAICSCQCPAAMAFCFLETLWWEFTASYDTTCIGLASRPYAFLEFDSIIQKVKW 

HFNYVSSSQMECSLEKIQEELKLQPPAVLTLEDTDVANGVMNGHTPMHLEPAPNFRMEPVTA 
LGILSLILNIMCAALNLIRGVHLAEHSLQDPRSWFCWLDQTS 
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FIGURE 77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAGAAAGTGAAGTGGCATT 
TTAACTATGTAAGTTCCTNTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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FIGURE 78 

CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 

CCCGACCTTAAAGAGTGGGGAGCAAAGGGAGGACAGAGCCCTTTAAAACGAGGCGGGTGGTG 

CCTGCCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTT 

TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 

CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 

TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 

GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 

CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 

TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 

TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 

CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 

TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 

GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 

AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAGAACAAGGCCAACAGAGAGTCACTCT 

ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 

CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 

AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 

CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 

CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 

TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 

GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 

CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 

TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 

GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 

GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 

GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 

TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 

ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 

GTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCCAGCTGGGGGTGGGA 

AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTTGG 

ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 

GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 

CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 

ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 

AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAGATTGTCTCC 

CTTGCCTCTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 

AAAGACCAAGGGGATAGGGAGGAGGAGGCTTCAGCCATCAGCAATAAAGTTGATCCCAGGGA 
AAAAAA 
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FIGURE 79 

ME APDY EVLS VREQLFHER I REC 1 1 STLLFATLY I LCH I FLTRFKKPAE FTTVDDEDATVNK 
IALELCTFTLAIALGAVLLLPFSIISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFLMPFAYFFTESEGFAGSRKGVLGRVYETVVMLMLLTLLVLGMVWVASAIVDKNKANRESL 
YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 
AALTRRICNPTSCWLPLD^LLHRQVLALQTQRVLLEKRRKASAWQRNLGYPLAMLCLLVLT 
GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQIIGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 
GCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGAAGC 
ACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 



v 
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GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 

CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 

TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 

CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 

AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 

GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 

TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 

GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
CCATCATCAGCAATGAGGTGCTGCACTCCC 
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GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTT 

GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 

GGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 

ATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAA 

GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 

TACTAAGAGAAACCCCAGACAATATCCTTGAGGTTCAGGAATCTGGAGAGTACAGATGCCAG 

GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 

TCATGCTGCCCAGGCTAATGTTGAACTCCTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTC 

AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 

TCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATTTACAAGAA 

TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRFYSPQKTKWYHRYL 
GKEILRETPDNILEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 

GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 

GAGTGTGTGGAACAGGACCCGGGACAGAGGAACCATGGCTCCGCAGAACCTGAGCACCTTTT 

GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 

GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 

GCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTG 

CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 

GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 

TGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATA 

TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 

AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 

GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 

GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTG 

AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 

AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATT 

TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 

CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 

ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 

TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 

CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATATTGAGAGTG 

AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 

TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 

TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 

AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 

TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGA 

GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 
TTATTTTTA 
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MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQ 
DRNI PRGSD 1 1 VDLEVTLEEVYAGNFVE WRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKWKH 
PI FERRGDDLYTNVTI SLVESLVGFEMD ITHLDGHKVH I SRDK I TRPGAKLWKKGEGLPNFD 
NNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 



WO 99/63088 



FIGURE 86 



PCT/US99/12252 



TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCGAA 

GCGGCGGGCGGAGGAGGTTTTGAGGATTTTTGGAACAGGACCCGGACAGAGGAACCATGGTT 

CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 

ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCTTNTATAAAGGATATTAAAAAGG 

CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCACAAGCCCAG 

GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 

GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGAGACATTT 

TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGACAGA 
AATATTCCAAGAG 
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GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 

CTCTGCAGGTGTCGTGGAGGAACCTAGCACCTGCCATCCTCTTCCCCAATTTGCCACTTCCA 

GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 

AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 

TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 

TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 

TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 

GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 

AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 

GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 

TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 

TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 

AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 

CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATTTAGTGCCT 

ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 

CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 

TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 

GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 

CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 

AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 

TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 

TTCTTTTGGCAAGACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 

GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 

ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 

GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE S8 

METVVIVAIGVLATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDD WI TNPH I EAI LENEDWI EDASGLMSHC I AI LK I CHTLTEKL VAMTMGSGAKMKTS AS 

VSDIIWAKRISPRVDDWKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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FIGURE 89 

GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 
CCACCACTGCAGCCATGATCTCCTTAACGGACACG^^ 

TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 

TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 

TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 

CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTCTCTTGTTCAG 

GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCTTC 

TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTATAACAACA 

AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 

GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 

CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 

ACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 

ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTCAGCCATCCTTAGAGAA 

CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 

TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 

TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 

ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFVVLIGWPLIGMIFEIYGFFLLFRGFFPVVVGFIRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 

Important features : 
Transmembrane domains: 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class-II protein. 

amino acids 4 9-59 
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FIGURE 91 

GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTCCCGGCTTCATTTCTCCCGACTCAGCTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 
CGGACACGCAGAAAATTGGAATGGGATTAACCGGATTTGGAGTGTTTTTCCTGTTCTTTGGA 
ATGATTCTCTTTTTTGACAAAGCACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 
GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 
CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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FIGURE 92 
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GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 
GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 
CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 
TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 
ATCAACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAA 
GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 
CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 
GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGA 
ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 
TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 
CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 
GATTCACTGTGAGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCC 
GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 
ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 
TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 
TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 
AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 
GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGC 
TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 
CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 
GACGGAAATTGACAAGGGCTCTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACT 
GACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCT 
CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGA 
CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 
TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 
TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAG 
GG CAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 
AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGCTTTAGGTGAGATGACTAAATTATGCCTC 
CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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FIGURE 93 

^IJ^GLLKSQFLCHLVFCYVFIASGLIINTIQLFTLLLWPINKQLFRKINCRLSYCISSQLV 
MLLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVIA^ 
ELAYVPIIGWMWYFTEIWFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHE 
ISMQVARAKGLPRLKHHLLPRTKGFAITWSLRNWSAWDCTLNFRNNENPTLLGVLNGKK 
YHADLYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 
WLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 
KQKLND 
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FIGURE 94 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 

GCGCACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAA 

GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 

TACAATTGACATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAG 

GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 

TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 

AAACTTGCAGGAGCATTTTTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAA 

TAACAGAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 

TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 

TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 

TTTTTGAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTTCATTA 

CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 

AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAGCACAGATTCAGG 

CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTTTCTTTGTCAGGCATTA 

CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGTCTTTAAAAAATAGACA 

TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 

TGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCACAAATCATTAAGCAT 

AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 

C7VAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 

CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 

TTTTGATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 

TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 

CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 

ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 

TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAA 

TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 

TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 

TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 

TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 

AT AAATTTG C AA71ACATCAT CT AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 95 

MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGE VKGE AKNS I TDSQMDDVE WYT I D I Q 
KY I PCYQLFS F YNS SGEVNEQALKK I LSNVKKNWGWYKFRRHS DQ IMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 
RLKRE I EKRRGAQ I QAAREKNI QKDPQENI FLCQALRTFFPNSEFLHS CVMSLKNRHVSKS S 
CNYNHHLD WDNLTLMVEHTD I PEASPASTPQI I KHKALDLDDRWQFKRSRLLDTQDKRSKA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 

GCGCAGGGCAGCCCAAGCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 

CCGCGGCGCCTGCCACCCTTCCCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 

CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 

CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 

GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 

ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 

GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 

CCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 

CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 

CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 

AGGGTTCTACCTGCTGCTCTCAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT 

TTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTGCAAGCTGTCTTTGCTTCACGTTACAA 

GAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTGAATGATATGT 

TTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTA 

GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 

GGCTCGCCTCCTGGAGCGGATGTTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGT 

ATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAGCCCTTCGGAGATGTCCCTCGCAAA 

TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 

TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 

TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAAC 

TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 

CAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTAGAGGGTCCTTTCAACATTGAAT 

CGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 

GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 

AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 

AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 

CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 

GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 

TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 

AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGAT 

GAAGAATGCATACAATGGGAACGACGTGGACTTCTTTGATATCAGTGATGAAAGTAGTGGAG 

AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 

GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 

GGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAAT 

TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 

CCATCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 

CACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGGAGGAAAAGGGACTGTG 

CATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 

CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 

ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAG 

GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAGCA 

GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC ~" ' 
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FIGURE 97 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEWRLWS 

GSTCCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 
VKTYGHLYMQNSELFKDLFVELKRYYWGNVNLEEMIoNDFWARLLERM 

LECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHAL 
LKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGD^ 

VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 
RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLF 
AVTGNG LANQGNNP E VQ VDT S KPD I L I LRQ I MALRVMTS KMKNAYNGNDVDF FD I S DE S S GE 
GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 
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FIGURE 98 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 
GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 
GCAACTTACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGC 
TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 
AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAA 
AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGC 
AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGA 
AAGCCAAACAAGCATTCCAGAGCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTT 
TGCTCTGCCTTTGTAGGAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 
GACAGTGAGCACACCTACCAGACACTCTTCTTCTCCCACCTCACTCTCCGACTGTACCCACC 
CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 
TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 
AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 
AATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTTAAAT^ 
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FIGURE 99 



MKVLISSLLLLLPLI^^S^SSSLNPGVARGHRDRGQASRRWLQEGGQECECKDW 
KFMTVSGLPKKQCPCDHFKGlSr^^TRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 
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FIGURE 100 



PCT/US99/12252 



AATGGCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCAGACGACTCGGGCAAAGACCCA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTGACATCCTCAGGACACACCCATGTGGCTCCTGGACAATCCAAGAGCA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 101 

MAVLVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENAVE 
FILRSMSRSTGFMEFDDNEGKHSSK 
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FIGURE 102 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCT 

CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTAC 

CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 

CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 

CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 

CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 

CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATAT 

GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGT 

GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 

GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 

AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 

CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 

AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 

AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 

AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTAGCAGAATGAGAGAAGACATT 

CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 

CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 

TGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 

CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 

CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 

CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 

GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 

AACCACG 
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FIGURE 103 

MD I LVPLLQLLVIJjLTL^ 

KGLTGASGKVALLEI^CGTGANFQFYPPGCRVTCLDPNPHFEKFLTKS^4AENRHLQYERFVV 
APGEDMRQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHV 
WQQWEPTWK^IGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFP 
SSKALI CSFPSLQLEQATHQP I YLPLRGT 
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FIGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCAGG 
CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAAA 
ACTAATATTTATATG ACAG AAGAAAAAGATGT C ATT C CGTAAAGTAAAC AT C ATCAT CTTGG 
TCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGAGCAGTTTG 
TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 
TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 
AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 
GTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCCGGTCCTGGCTCAACAG 
TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 
AAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTAC 
TTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCA 
AGGTGAT ATTCT TG CCCTTTACAATAC AGCACTG AAG C C AGG AC ATG CAG CTGC ATTT TC AG 
AAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCCGTGGAGCAGGAAACCAGTACAATTAC 
ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 
CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 
ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 
GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 
TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 
AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 
ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 
ATATACCGAGATCTCAAACATAAAGTGAAACAGAATTTGAACTGTAAGCAAGCATTTCTCAG 
GAAGTCCTGGAAGATAGCATGCATGGGAAGTAACAGTTGCTAGGCTTCAATGCCTATCGGTA 
GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 
AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 
ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 
TAATTCAAAACTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 
ACATTTTTC 
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FIGURE 105 



MSFRKVNI 1 1 L VLAVALFLL VLHHNFLS LS SL LRNE VTDS G I VG PQ P I DF VPNALRHAVDGR 
QEE I PWIAASEDRLGGAIAAINS IQHNTRSNVI FY I VTLNNTADHLRSWLNSDSLKS IRYK 
I VNFDPKLLEGKVKED PDQGESMKPLTFARF YLP I L VP SAKKAI YMDDDV I VQGD I LALYNT 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
NLTEWKRQNITNQLEKWMKLWEEGLYSRTIAGSITTPPLLIVFYQQHSTIDPMWNVRHLGS 
SAGKRYS PQF VKAAKLLHWNGHLKPWGRTAS YTD WEKWY I PDPTGKFNL I RRYTE ISNIK 
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FIGURE 106 



PCT/US99/12252 



TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 
CAGGAATTNTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCTCCGACATGCAGTAGAT 
GGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATNTGAAGACAGGCTTGGGGGGGCCAT 
TGCAGCTATAAACAGCATTCAGCACAACACTCGNTCCAATGTGATTTTCTACATTGTTACTC 
TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 
TACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCA 
GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 
CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 
AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTAC 
TAAAGTTGTCATCCGTGGAGCAGGAAA 
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FIGURE 107 

CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 
GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 
TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 
TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 
TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 
TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 
CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 
CATAACTTCCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATTCAGGAATTGTAG 
GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 
GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 
ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 
GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 
TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 
ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 
CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 
CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 
CATCCGTGGAGCAGGAAACCAGTAGAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 
TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 
CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 
AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 
TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 
GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 
TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 
GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACATAAAGTGAAA 
CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 
TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 
GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 
ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 
CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 
GTAACCTGTGG C CTG ATCTGTAAATAAAACTTACATTTTT CAAT AGGT AAAAAAAAAAAAAA 
AAAAAA 
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FTGIJRE 108 

CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 

GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 

TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGGGGGCAGCCATCTCCCAGGGGGCCCTCATC 

GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 

GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 

TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 

TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACCTAAGTCCACCTCACCTAGAG 

CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 

TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 

TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 

ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 

TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 

GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 



AAAAAAAAAAAAAAAAA 
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FIGURE 109 

MGAAISQGALIAIVCNGLVGFLLLLLWVILCWACHSRLPTLTLSLNPVPTPALAPVLRRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 
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FIGURE 110 

GTTTGAATTCCTTCT^CTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCC^GGCTACCA 
GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 
CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 
TCATCTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 
ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 
TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 
TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 
TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 
GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 
AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 
AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 
TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 
TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 
AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 
TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 
TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 
TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 
TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 
GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 
ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 
CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 
CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 
ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCT 
GG AAAAGATTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 
AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FTGIJRE 111 

MSGRDTI LGLC I LALALSLAMMFTFRF ITTLLVH I F I S LV I LGLLFVCGVLWWLYYD YTNDL 
S IELDTERENMKCVLGFAI VSTG I TAVLLVL I FVLRKR I KLTVELFQ I TNKAI S S APF LLFQ 
PLWTFAILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFI 

1ACQQMTIAGAWTCY FNRSKNDPPDHPILSSLS I LFFYHQGTWKGSFLIS WRI PRI I VM 
YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 
SKNS SHFTS INCFGDF 1 1 FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 
SFLSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRN 

EEGTELQAIVR 
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FIGURE 112 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTT 

AGAATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTG 

TGTGGTGAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATT 

T ATGA GGACTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGG 

TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTG 

CCTCAGATCAACTGCGATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATG 

TCCAGCAGGATGCCAAGACCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACT 

CCAGTGTGTGTGGCGCTGCCGTACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTT 

GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTT 

ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAA 

CCTACCCATCAGCTCTTACATACTCATCATCGAAAAGTCCAGCTGCCCAAGCAGGTGAGACC 

ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAGCCGGTCACTCTGATGCA 

GCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCACCACCTTGCCAAGGCCATCCCCTTCTG 

CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGGGCCACAGGAGCCAGGAGATGGAT 

CTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCCAGAGCTGATCCAGGTATCCA 

AAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGCGGATGTCAGCCTGGGAC 

TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCCTGGGAGATCCAAAC 

TGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAACGGCGATTCCG 

AATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGCCGGTCCAC 

TGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACACACACG 

AATTCTCGAGATCTGAAGACAGCCATAGAGAAAATTACTCAGAGAGGAGGACTTTCTAATGT 

AGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCG 

GGGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCT 

TCAAGACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGA 

AAATGAGAAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACG 

GCTTCTACTCGCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTG 

AAGCGGGTCTGCGACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACAT 

TGGCTTCGTCATCGACGGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGT 

TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTG 

CAGTACACCTACGAACAGCGGCTGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACAT 

CCTCAACGCCATCAAGAGGGTGGGCTACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCA 

ACTTCGCCCTGGAGCAGCTCTTCAAGAAGTCCAAGCCCAACAAGAGGAAGTTAATGATCCTC 

ATCACCGACGGGAGGTCCTACGACGACGTCCGGATCCCAGCCATGGCTGCCCATCTGAAGGG 

AGTGATCACCTATGCGATAGGCGTTGCCTGGGCTGCCCAAGAGGAGCTAGAAGTCATTGCCA 

CTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGAGTTTGACAACCTCCATCAGTATGTC 

CCCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCACAGCCTCGGAACTGAATTCAGAG 

CAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTTGGACCACCCCACCGCTTAA 

TGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAACAAATGTCTTGTTATTA 

TTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGATGATCACAAACGT 

ATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACATTTTGACAATT 

GTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAGAGCTTTT 

GTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCATTTTT 

GTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 113 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKC 
PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 
SLPRWRESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 
LLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 
RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 
IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 
GRAI SFVTKNFFSKANGNRSGAPNWWMVDGWPTDKVEEASRLARESG IN I FF I T I EGAAE 
NEKQYWEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADI 
GFVIDGSSSVGTGNFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDI 
LNAI KRVG YWSGGTSTGAAI NF ALEQLFKKS KPNKRKLM I L I TDGR S YDDVR I P AMAAHLKG 
VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 
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FIGURE 114 



CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 
TGCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 
GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCCACTACAGTTTTTCTCTGACTCTAAT 
TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 
TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 
CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 
GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 
TCCGAGCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACTTCATGGCGTG 
AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 
CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 
GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 
AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 
GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 
CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 
GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 
AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 
GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 
CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 
CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 
TTGCAACAATCAAAGATCTGCGAGACCACAAGCTGGACAACCGCATGGAGTCGTTCTTCCTG 
GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 
GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 
TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCTGAAG 
GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 
GAAATTTCAGAAAAACACTGTTAGTTCGGGGCCATGGGAACCTCCAGCAAGGCCAGGAACAC 
TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 
CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 
AGACTCCTC ATAA CCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 



o 

WO 99/63088 . PCT/US99/12252 

FIGURE 115 

MPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYSFSLTLIDALDTLLILGNVSEFQRWE 
VLQDSVDFDIDVNASVFETNIRWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 
LPAFQTPTGMPYGTVNLLHG VNPGET P VTCTAG I GT F I VE F ATLS S LTGDPVFED VARVALM 
RLWESRSDIGLVGNHI DVLTGKWVAQDAGI GAGVDS YFEYLVKGAI LLQDKKLMAMFLEYNK 
AI RNYTRFDDWYLWVQMYKGTVSM PVFQSLEAYW PGLQ SL I GD IDNAMRTFLNYYTVWKQFG 
GLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 

FAT I KDLRDHKLDNRMES FFLAETVKYL YLLFDPTNFI HNNGSTFDAVITPYGECI LGAGGY 
IFOTEAHPIDLAALHCCQRLKEEQVfEVEDLMREFYSLKRSRSKFQKNTVSSGPWEPPARPGT 
LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 
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AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 
GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 
ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 
GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 
. AATCCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 
ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 
TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 
AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 
GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 
ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 
GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 
TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 
CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 
GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 
CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 
GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 
GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 
TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 
GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 
CTCATAGGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 
ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 
CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 
TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 
TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 
TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 
GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 
TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 
AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 
CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 
GTGCAATGCGACGAGAATGCAGAAGTC^GTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 
TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 
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MQTFTMVLEEIVTTSLFMWFFYALIPC^ 

TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 

SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQ 

GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQ 

WPLFWKMGRLLQ YS CC PWVLPDTLK I TNS PQKL I S CRREEVDACATAVMS PEELLRAW I S 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology. 

amino acids 232-262 
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TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCCAC 
ATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 119 

CGGACGCGTGGGCCGCC^CCTCCGGAACAAGCCATGGTGGCGGCGACGGTGGCAGCGGCGTG 
GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 
TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 
AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 
AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 
AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 
ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 
GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 
TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 
GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTATAACCACCGCGTCTCCTCCTCCACCA 
CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 
AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 
AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 
AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 
TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 
CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 
CAAATAGGAGGCATTCAATGAACATTTTTTGCATATAAACCAAAAAATAACTTGTTATCAAT 
AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 
GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 
AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 
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FIGURE 120 

WAATVAAAWLLLWAAACAQQEQDFYDFKAWIRGKLVSLEKYRGSVSLVVNVASECGFTDQ 
HYR^QQLQRDLGPHHFNVLAFPCNQFGQQEPDSl^EIESFARRTYSVSFPMFSKIAVTGTG 
AHPAFKYLAQTSGKE PTWNFWKYLVAPDGKWGAWDPTVS VEEVRPQ ITALVRKL I LLKREDL 
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FIGURE 121 

CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATGGCTGTCTACGTCGGGATGC 
TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 
CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 
CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 
ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 
TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 
AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 
CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 
TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 
GGCCCTTGTGTTCCCCAAGCAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 
GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 
ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 
TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 
ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 
CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 
ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 
GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 
AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTGCTGTATGGTACCCCCACGAT 
GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 
GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 
ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 
CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGG 
CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 
ATCCGAGGGTACTGCGTC ATGCTGGGCT ACTGGGGTGAGC CT CAGAAGACAGAGGAAGCAGT 
GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 
AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 
CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 
CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 
TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 
GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 
GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 
GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 
TGAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 
CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 
TCCATCCCCCACATTCCCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 
GAAAAAAAAAAAAAAAA 
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MAVYVGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRMVSTPIGGLSYVQ 
GCTKKHIJtfSKTVGQCLETTAQRVPEREAL^ 

DRLGMWG PNS YAWVLMQLATAQAG 1 1 LVS VNPAYQAMELE YVLKKVGCKALVFPKQFKTQQY 

YIST^KQICPEVENAQPGALKSQRLPDLTWISVDAPLPGTLLLDEVVAAGSTRQHLDQI.QYN 

QQFLSCHDPINIQFTSGTTGSPKGATLSirasriVNNSNILGERLKLHEKTPEQLRMILPNPLY 

HCLGSVAGTMMCLMYGATLILASPIFNGKKAL'EAISRERGTFLYGTPTMFVDILNQPDFSSY 

DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 

RIMPHTEARIMNMEAGTLAKLNTPGELCIRGYCVMLGYWGEPQKTEEAVDQDKW 

MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQWGVKDDRMGEEICACIRL 

KDGEETWEEIKAFCKGKISHFKIPKYIVFVTNYPLTISGKIQKFKLREQMERHLNL . 
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FIGURE 123 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 
ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 
CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
CCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAATATGAAGGACCTGGTGGT 
TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGT 
CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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FIGURE 124 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 
TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGC 
GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 
AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 
TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 
ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTG 
CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 
GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 
GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 
TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 
CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 
CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 
ACCCACATCCACCACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAG 
AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 
CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTG 
TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGT 
GAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 
CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 
CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 
ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 
TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 
AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 
GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGG 
GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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MDPARKAGAQAMIVTTAGWLIJjLLLRGGAQALECYSCVQKADDGCSPNKMKWKCAP 

EAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSRA^ 

DPAGNESAYPPNGVECYSCVGLSRE^CQGTSPPWSCYNASDHVYKGCFDGNVTLTAAN^ 

SLPVRGCYQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 

VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 

QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 



WO 99/63088 



PCT/US99/12252 



FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGT 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAG 
GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 
CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 
TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 
AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAA 
CCTGAGGCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTC 
AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTA 
ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 
GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 
GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 
CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 
CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 
GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCAC 
TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 
ACCTGATCCCAGTCTGAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCC 
CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 
ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 
ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 
GCGGTGAGCGAGGTGGGTTGGAGACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAG 
GTTGAAGGTCATAACGAGAGTGGGAACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTT 
CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 
CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAVVLFSVGILL^ 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 
AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 
CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGT 
GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 



WO 99/63088 . PCT/US99/I2252 

FIGURE 129 

MKIPVLPAWLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFES IKRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 130 

CAGTTCTGAAATCAATGGAGTTAATTTAGGGAATACAAACCAGCCATGGGGGTGGAGATTGC 

CTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 

TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 

TTTGCCGGCCACTCATGAGAGTGTTTTTGTGTAAAGTATTTTTTAGAATACTGTTGACTTCT 

TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 

AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 

AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 

TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGACCTCTAGGAGAATATTT 

TATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAATGTGATTGTTAATTTAAA 

TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 

AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 
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FIGURE 131 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 132 

GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 
TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 
GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 
TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 133 

MEVWASSPLRLV^LLFLLPSAQGRQKESGS 

EDLTPFRGGISRKMMAEWRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMVINVRDYPQVPKWME PA I P VFSFSKTSEYHD IMYPAWTFWEGGPAVWP I YPTGLGRWDL 
FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 
K5KPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 
YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQ 1 1 PKMLKTEL 
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FIGURE 134 

CACCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 
TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 
GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 
CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 
ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 
AGGTCACTGT ATGTGG CCTG CACTGC C CTGGC CTTG CAG C TGGTGATG CGGTACTGGGAGCC 
CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 
TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 
GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 
TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 
TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 
TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 
CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 
GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 
ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 
CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 
CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 
TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 
CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 
GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 
GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 
GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 
CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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FIGURE 135 

MAPALLL I PAALAS FI LAFGTGVE FVRFTS LR PLLGGI PE SGGPDARQGWLAALQDRS I LAP 
lAWDLGLIiLFVGQHSimAERVKAWTSRYFGVLQRSLYVACTALALQLVMRYVfEPIPKGPV 
LWEARAE PWATWVPLLCFVLHV I S WLL I FS I LLVFD YAELMGLKQVYYHVLGLGE PLALKS P 

RALRLFSHLRHPVCVELLTVLWWPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLLSRPQDGEAE 
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FIGURE 136 



PCT/US99/12252 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 

AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 

TTTAATTAAGCATGGAATACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATT 

CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 

TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 

AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 

CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 

TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 

ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTGAAACTGCTCTCATCAAAATCC^ 

CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 

TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 

AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 

GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 

ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 

TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 

TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 

TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 

TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 

GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 

CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 

CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 

ACATGCCATTATT^ACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 

GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 

AACTGAAACTCATGAT^AAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 

AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 

GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 

TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 

AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 

TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 

TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 

CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGAT7VAAAAGGATAGTGAAT 

CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 

TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 

ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 137 

MASALlAm^PSRMSLRSLKWSLL^^ 

FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQE^ 
EDKMLALSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVM 
TGNLVKYLLNIJraSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
LVPRIYEMMGHVKPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHG 
FSSKE 1 1 TFWQVMLRNTTCHY 
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FIGURE 138 



CCTCTGTCCACTGCTTTCGTGAAGACAAGATCAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTAAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAATAAAATCATTTAGCATTGAAAAAAA 
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FIGURE 13g 

MKFTIVFAGLLGVFLAPAIANYNINVOTDNNN^ 

IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLS KFGKN I ANMCRG I PTYMAEEMQEASLFF YSGTCYTTS VLW I VD I SFCGDTVEN 
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FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 
TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 
CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 
TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 
GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 
GAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 
TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 
CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 
CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 
TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCAAATTAAAAGCT^AGGGA 
TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 
AT CATC AC C AAG AG AAGTAAAAGCTGAAG AG AAATC AC C AAT CT C C ATT AATGTGAAG AC AG 
TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 
GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTCAAGAACACGATCACG 
TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCTGGAACATACA 
GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 
GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 
TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 
CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 
GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 
CTGACTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 
ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 
GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 
AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 
TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 
ATACAGATT^AAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 
ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 



PCT/US99/I2252 



MNDSLRTNVFVRFQPET I ACACI YLAARALQI PL PTRPHWFLLFGTTEEE IQE I C I ETLRLY 
TRKKPNYELLEKE VEKRKVALQEAKLKAKGLNPDGTPALS TLGG FS PAS KPS S PRE VKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 
AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 
G AAAAAGAAGTAGAAAAAAG AAAAGTAGCCTT AC AAGAAG C CNAATTAAAAG C AAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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FIGURE 143 

GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTC ATG GC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 
TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 
GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCGGAGATGCCCCGTTCCAGCTGG 
AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 
GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 
CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 
GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 
GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 
CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 



WO 99/63088 ^V^f/lA 0 PCT/US99/12252 

FTGTJRE 144 

MAVLGVQLVVTLLTATLMHRLAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPRGRKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 

YMLGPAKETNI AVFWCLLTVTFS I KMFLTVTRLYFSAEEGGERS VCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FDSGRLWLLWLCLLRLAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRWRVYCYVT 
WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 



WO 99/63088 __AH£>/l>VO PCT/US99/122S2 

FIGURE 145 

CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCAC^GGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 
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FIGURE 146 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 
TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 
ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGC 
CAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 
CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 
TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 
CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACC 
ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 
CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 
TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 
CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGA 
AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 
CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGT 
AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 
AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 
TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 
ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCG 
AGATGACAGCTTTGATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAATAAAAC 
TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 
TATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCG 
ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 
TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 
AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 
AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCGGCATGA 
CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 
CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 
TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 
CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 
CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 
CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 
CAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCT 
TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 
TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 
GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 
TTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 
ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGG 
AAAAGTATGATCATCTAATGAAAGCCAGTTACACCTCAGAAAATATGATTCCAAAAAAATTA 
AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 
TTTCTATTC^TAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 
CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCA 
TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAA 
ACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 147 

MKALLLLVLPWLSPANYIDNVGJ^HFLYSELCKGASHYGLTKDRKRRSQDGCPDG 

APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRS3STRTRARPFERSTIRSRS 

FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 

SESLS I RLVGGSETPLVH 1 1 IQHI YRDGVI ARDGRLLPGD 1 1 LKVNGMD I SNVPHNYAVRLL 

RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGV 

FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 

PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHRE 

WDLP I YVI SVEPGGVI SRDGRI KTGD ILLNVDGVELTEVSRSEAVALLKRTSSS I VLKALEV 

KEYEPQEDCSSPAALDSNHNl^PPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 

GGYEEYNGNKPFFIKSIVEGTPAYlSnDGRIRCGDILI^VNGRSTSGMIHACLARLLKELKGRI 

TLTIVSWPGTFL 



WO 99/63088 _ . / _ PCTAJS99/12252 

FIGURE 148 

CCAAAGTGATCATTTGAAAAAGAGATATCCACATCTTCAAGCCCATATAAAGGATAGAAGCT 
GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAAATGGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCAACAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TGCAGACATTCATGTTTAGGATGATTAGCCCTCTTGTTTTATCTTTTCAAAGAAATACATCC 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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FIGURE 149 

MK I LVAFLWLT I FG I QSHGYEVFN IIS PSNNGGNVQETVT I DNEKNTAI VN I HAGSC S STT 
IFDYKHGYIASRVLSRRACFILKMDHQNIPPLNNLQWYIYEKQALDNMFSNKYTWVKYNPLE 
SLI KDVDWFLLGS P I EKLCKHI PLYKGEWENTHNVGAGGCAKAGLLG I LG I S I CAD IHV 
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FIGURE 150 
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GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
ATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAGAATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREE 

CGTKNIKVAKKVVKTHEPASFNIjNVTLKSSPDLLTYFCRASSTSGAHVD^ 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTWPPGGDQKMEDWQGPLESPILALPLYRSTR^ 
GFRIGNGEVRGRKAAAM 
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FTGTJRE 152 
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GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 

CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 

CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 

CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 

CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 

TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGC 

AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 

TTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCC 

TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 

ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 

CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 

AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 

TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 

GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 

GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 

TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 

TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 

TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 

GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 

AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 

TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 153 

MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVS PLGKKLNVTTAWKAQNPVLRE WD I LTEQLRD I QLENYTPKEPLTLQAR 
MSCEQKAEGHSSGSWQFS FDGQ I FLLFDSEKRMWTTVHPGARKMKEKWENDKWAMSFHYFS 
MGDC IGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLI LCCLLI I LPCF ILPG I 
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FIGURE 154 
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GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 

CCTTGGGCCTCACAATTTTCATTCTGTTTTCTGACTTTCAAGTTATATACCGTGGAATGGAG 

TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCTCACCCAATTCTG 

GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 

GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC . 

TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 

CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 

CCACAGAACAGCATTTCTGGGCCAGGCTGTAATCAGAATTGTCGTCGTACATGCTCAACAGC 

ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 

CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 

TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 

CATCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 155 



MELI PTITSWRVLILWALTQFWCGFLCRGFHLQNHELWLLI KREFGFYSKSQYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACCATSAAG 

CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCT 

CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 

AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 

GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 

CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 

TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 

ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 

CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 

ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 

GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 

AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 

TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 

AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 

CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 

TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 

TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 

GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 

TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 

CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 

CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 

TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 

GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 

AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 

CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 

CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 

GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 

CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 

ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 

AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 

ATGACCATATTAACTACAACACCTAC^^CCAGCACATGGGGCCCACTGGACAGAAAACAGC 

CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 

TACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCTCCCCCAAAAAACTTATAAAAT 

GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 

TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 

CAAAAAGTCAAAACA 
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FTftlIRE 157 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKVVCT 
RRGLSEVPQGI PSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNS IRQI EVGAFNGLAS 

LNTLELFDNWLTVI PSGAFEYLSKLRELWLRNNP I E S I PS YAFNRVPSLMRLDLGELKKLEY 
ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 
V^SQVSLIERNAFDGIASLVELNLAHNNLSSLPHDLFTPLRYLVEIJILHHNPWNCDCDILW 

LAWWLREY I PTNSTCCGRCHAPMHMRGR YL VE VDQASFQCSAP F I MDAPRDLN I SEGRMAEL 
KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGVYTCMVTNVAGNSN 
ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQ 

TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLI VFYKLRKRHQQRS 
TVTAARTVEIIQVDEDIPAATSAAATAAPSGVSGEGAVVLPTIHDHINYNTYKPAHGAHWTE 

NSLGNSLHPTVTTISEPYI IQTHTKDKVQETQI 



WO 99/63088 



PCT/US99/12252 



FIGURE 158 

CGCTCGGGCACCAGCCGCGG(^GGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 
TTCTTCAGCTCCTTCTCATCTCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGC 
CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 
CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 
AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 
AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 
CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 
GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGGTAAA 
CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 
GTATGACTATGTTGAGGTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCT 
GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACTCCACGTCCTCTTCCAC 
TCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGCATGCTC 
CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 
CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 
GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 
ACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 
GCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATA 
AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGT 
TCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGC 
AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 
CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 
GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCG 
GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC 
ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGGTTCCTAGT 
CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 
TGGGGAAGGTCACCATGATCAAGACAGCAGACCTGAAAGTTGTTTTGGGGAAATTCTACCGG 
GATGATGACCGGGATGAGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCC 
CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 
GTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTC 
CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 
CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 
AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 
CCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTGCTTCCC 
GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 
AAACATGCAGCCACAGGCTCTCCACTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATT 
GAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 
TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 
CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 
TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 
AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 
TCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 
TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 
ATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

l^LGCWTQLGLTFLQLLLISSLPREYW^ 

GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 

CMRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQTO 

GDNRDGQIIKRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDG 

TC^DKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 

VSFFCNNSYVLSGNEKRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLH 

QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 

WSGRAPSCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 

RTVVVAAHCVTDLGKVTM I KTADLKWLGKFYRDDDRDEKT IQSLQI S AI ILHPNYDP I LLD 

AD I A I L KLLDKAR I STRVQP I CLAASRDLSTS FQESH I TVAGWNVLADVRS PGFKNDTLRSG 

VVSVVDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 

WHLMGLVSWS YDKTCSHRLSTAFTKVLPFKDW I ERNMK 
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ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGA 

AGCTTTCTTGCCTGGAGTGAAGCAGAGAGATAGATATTATTCACGT^ 

TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 

GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 

CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACT 

AATGAAGCATCCACGAAGAAGGTAGAACTTGACAACTGTCCTTCTGTGTCTCCTTACCTCAG 

AGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCACTT^ 

CCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAGCT 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTTCCT 
GCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGGTAAAAAGTTTA 
ATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGACTGC 
TTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGA 
GCATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGAT 
ATTTTGGGGGTGTTACTGCCCTAAGCAGAGXGCAGTTTTTCAAGGTGAATGGATTCTCTAAC 
AACTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAAT 
GAAAATTTCCCGGCCCCTGCCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACA 
AAGGCAATGAGGTGAACGCAGAACGGATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGA 
ACAGATGGGTTGAGTAGTTGTTCTTATAAATTAGTATCTGTGGAACACAATCCTTTATATAT 
CAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGGAAG 
AACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAAATAGTAGCACACATTAAGA 
ACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCTTAGCAGAGCTCCT 
GGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGATCATGAGG 
GTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAAA 
ATGAACGCTATTTGAGGAGTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTAT 
GGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGT 
CCAAGGTAGAAAGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTG 
AAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGA 
CACAGTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAG 
TAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCTTCC 
AGTGATGCCCACCAGAGAATACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGATTT 
TGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATATGT 
CTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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MG FNLTFHLS YKFRLLLLLTLCLTWGWATSNYF VGA I QE I P KAKE FMANFHKTL I LG KG KT 
LTNEASTKKVELDNGPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRV 
AILVPHRNREKHLMYLLEHLHP 

WDCFIFHDVDLVPENDFNLYKCEEHPKHLVVGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 

FSNNYWGWGGEDDDLRLRVELQRMKISRPLP 

VWRTDGLS S CS YKLVS VEHNPL Y INI TVD FWFG A 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 



Xylose isomerase proteins: 

amino acids 191-201 
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FIGURE 162 
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CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGC^ATG 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TGCTTCAGCATCCGGGCCGAGAATATCATCAGCAAGACACATCAGTACCACAAGATCCAGGT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 
CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 
ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATAT AAACACATGCGT CACATGGG CATTTCAG ATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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MAQAVWSRLGRILWLACLLPWAPAGVAAGLYELNLTTDSPATTGAWTISASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWVTAADCWMCQPVARGF 
WLPITEFLVGDLWTQNTSLPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
r^EDSVVYYNYSIIGTFTVKLKVVAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 

GPTL I QTFQKMTVTLNFLGS PPLTVCWRLKPE CL PLEEGE CH PVS VASTAYNLTHTFRDPGD 
YCFSIRAENIISKTHQYHKIQVWPSRIQPAVFAFPCATLITVMLAFIMYMTLRNATQQKDMV 

ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain: 

amino acids 339-362 



N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145,. 197-200, 300-303 and 364-367 
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GCTCAAGACCGAGGAGTGGGACAGCGAGACAGACGGC^ 

GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 
CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 
GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 
GCGGCTGCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTG 
CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 
TGGCTGGTTCTTTTGTTTTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 165 



MALSSQIWAACLLLLLLLASLTSGSV 
FPICIFCCGCCHRSKCGMCCKT 



WO 99/63088 
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CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 
CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 
ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 
AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTA7LAAATCTTTGCGTGGGCTA 
CCTTGAGAATGGAGCGAGGAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAAC 
GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 
TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 
GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 
GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 
GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 
TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 
GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 
AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 
GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 
ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 
CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 
CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 
AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 
TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 
TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 
CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 
CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 
TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 
AAGGACAGGAGCCGCTCCTGAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 
CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 
CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 
GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 
CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 
CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 
GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCACCCG 
CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAACATCGCCCCCAGC 
CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 
AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 
GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 
ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 
GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 
AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTGTTCTTGTC 
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FIGURE 167 

MFLLLPFDS L I VNLLG I SLTVLFTLLLVF 1 1 VPA I FGVS FG I RKLYMKSLLK I F AWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESVWLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLLPLRIAIJ\FTG 
ISLLWGTTWGYLPNGRFKEFMSKHVHLMCYRI CVRALTAI ITYHDRENRPRNGGICVANH 
TS P I D V 1 1 LASDG Y Y AMVGQ VHGGLMG V I QRAMVKAC PHVWF ERS E VKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVMMFKKGSFEIG 

LRI^TSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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FIGURE 168 

GCCCCTCGAAACC^GGACTCCAGC^CCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTC^ 

CGTCTCCTCCAGGGATGGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 

ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCCCCTGGGCCTGGCT^ 

CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 

AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 

AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 

CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 

GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 

GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 

AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 

TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 

GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 

GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 

ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 

CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 

ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 

GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 

TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 



WO 99/63088 - / ^ PCT/US99/122S2 

FIGURE 169 

MAIjAALMIALGSLGLHTWQAQAVPTILPLGLAPDTFDDTYVGCAEEMEEKAAPLLKEEMAHH 
ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 



WO 99/63088 . PCT/US99/12252 

FIGURE 170 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATA^GCTGGTTCCCO^ACATGCCTCA 
CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 
GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 
TATTGTCTGGACCTTCAACAC^CCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 
TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 
CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 
CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 
TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 
GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 
CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 
GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 
GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 
CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 
AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCGCCAT 
TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 
TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 
TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCTAGACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 171 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAWFPLKSKVKQVDSIWTFNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYWGIYSSSLQQPSTQEYVLHV 
YEHLSKPKVTMGLQSNKNGTCVTNLTCCM 

GESDMTF I CVARNP VSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLSLFVLGLFLW 
FLKRERQEEY IEEKKRVD I CRETPN I CPHSGENTE YDT I PHTNRT I LKEDPANTVYS TVE I P 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCT 

TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 
CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 
AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 
GATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGT 
GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 
AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 
ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 
GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 
AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 
ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 
TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 
ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 
TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 
AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 
AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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FIGURE 173 

GAAAGACGTGGTCCTGAC^GAC^GAC^TCCTATTCCCTACCAAAATCAAGATGCTGCTGCT 
GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 
ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 
GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTT CTGG AG CAAATCCATGTCTTGGAGAA 
TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 
CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 
AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 
GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 
GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAG 
ACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 
TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 
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FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEKINGEWHTIILASDKREKIEEHGNFRLFLEQ 
IHVLENSLVLKVHTVRDEECSELSJWADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGRE PDLS SD I KERFAQLCEEHG I LREN 1 1 DLSNANRCLQARE 
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FIGURE 175 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAG 
AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 
CATTCATCCAGAATCCTTCAACTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTG 
GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCGACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 
TCAGTAGTTTGAAAAAAAAAAA 
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FIGURE 176 



MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMA 
I PATTMSLTARKRACCNNRTGMFLSS FFSVITVI GALYCMLI S I QALLKGPLMCNS PSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRL 
I HFS VFLGLLLVG I LEVLFGLSQ I VI GFLGCLCGVS KRRSQ IV 
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FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 
CACCGATCAGATATCTTTTAAGAAACGACTCTC^TTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 178 

MRLSVCU^SLALCCTQAHALVCPAVASE 
KHCTDQISFKKRLSLKKSWWK 
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FIGURE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGACATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATGCAAATACAAGAGCAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 



WO 99/63088 , PCTAJS99A2252 

FIGURE 180 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 
YKSSQKQHS PVPEKAI PL I TPGS ATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 
CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GTGACTTTCTGACCCATCCACCTG TTTT TTTG CAG ATTGT CATG AATAAAACGGTG CTGT CAAA 



WO 99/63088 / PCT/US99/12252 

FIGURE 182 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLI^ 

HLGLITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLF 

IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGS 

SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYY 

LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 

KALSDEDIRAEADTFMFGGHDTTASGLSWWiYNLARHPEYQERCRQEVQELLKDRDPKEIEW 

DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVW 

PDPEWDPFRFDPENSKGRSPLAFIPFSAGPRNCIGQAFAMAEM 

E PRRKLEL I MRAEGGLWLR VE PLNVGLQ 
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FIGURE 183 

C^CAGAAGCC^GAAGGAAGCCGTCTATCTTGTGGCGATCATCTATAAGCTGGCCtCCTGC 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTT7VACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTAC^GATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTGCACAAAAAAAAAAAAAAA 
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FIGURE 184 



^KIASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 
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FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCCATGCCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGTGATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLLGMLWLDLAMA^ 

GQAEGAEDELEVRFNAPFDVG I KLSGVQYQQHSQALGKFLQD I LWEEAKEAPADKO 
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FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 
CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
WRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 

AQQEAELTPRPAGWPGA 



WO 99/63088 / PCT/US99/12252 

FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACC ATGG CCAAG 
ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 
ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 
TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 
GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 
CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 
AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 
GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 
GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 
TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 
ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCGTTTGCTG 
CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 
TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 
GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 
ACACGTACACCAGGATGGTGCTGGAGTTCAAGTG CAAG C ATAGTAAGAGCTTCAAGG AAAAC 
CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 
CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 
GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 
AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 
GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 
TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 
CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 
CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 
TAAACCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 
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FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWE PCLWAQTERLCCCFLCPVRS PGDGGPHDVFTS LPSD CQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGAAAGA^GGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 
TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 
GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAACCTATG 
TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 
GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 
AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 
GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 
CAAT CTGTGTGG CT AGTGGGCAG ATT ACC ATG CAAG CC C CAGG AGAAATGGAGG AGCTTTGT 
AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 
GACATTCGCCCCTGTGTGCCACCT^AACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 
CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 
ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 
GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 
GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 
TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 
ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 
AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 
AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 
CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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FIGURE 192 



IWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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FIGURE 193 

GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 

GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 

ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGCCCCT 

CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 

ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 

GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 

TTGTGCACTTGTTCTGACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 

TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGAAATTACTGAACTATTG 

TCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 

TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 

CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 

ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 

GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 

TGTAATCATTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 

TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAG 

TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 

GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 

CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 

ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 

ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 

TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 

ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 

ATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACC 

TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 

CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 

CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 

ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 

TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 

CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 

GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 

AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 

GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 

TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAAAACT 

CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 

TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 

TAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCGGCGTGTGGCTGG 

AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 

GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 

CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 

TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 

GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 

GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGAATAT 

GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 

ATTTTAAATATCTATTTTAAAAAAAAAA 
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FIGURE 194 

MAGI KALI SLSFGGAIGLMFIMLGCALPIY^ 

SNACKELAI FLTTG I WS AFGL P I VF ARAHL I EWGACALVLTGNTV I FAT I LGF FLVFGSND 
DFSWQQW 
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CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 

CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 

GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 

CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 

CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 

AACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 

CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 

ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 

GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 

GGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCATATAATTAGCCTGCACACAA 

AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCGCTTCCAA 

TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCT 

GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 

CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 

GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 

CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 

ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 

GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 

TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCG 

CCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 

GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 

ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 

CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCA 

CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGA 

ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 

ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 

GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 

GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 

TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 

CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 

GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 

CACGACGGGACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 

CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGG 

TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 

TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 

TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 

ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCC 

CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 

CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 

TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 

AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 

TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 

GACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 

CATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGA 

ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 

TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 

CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 

AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 

TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTC 

TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 

TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 

ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 

GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTT 

TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 

GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 

TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTAC CCACAATG CAGCCTAT ACT 
CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 
TCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCT 
CTGCCTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATAT 
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FIGURE 195B 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTCAG 
CAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATG 
AGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAACCAGAA 
ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAAT 
TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 
TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTVfLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
I TQLPNTTFRPMPNLR SVDLS YNKLQALAPDLFHGLRKLTTLHMRANA I QFVPVR I FQDCRS 
LKFLD I GYNQLKSLARNS FAGLFKLTELHLEHNDLVKVNFAHFPRL I SLHSLCLRRNKVAI V 
VSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKWTGTMA 
LI FSFL IVVLVLWSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
IEGALVI INEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGCAAGGAGCCGAGGCGAGATGGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGACCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 
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FIGURE 198 

MGVIjGRVLLWLQLCALTQAVSKLWPOT 

GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRS I SALGRTFTRDEDLAVFLASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWI CAALLQP 
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FIGURE 199 

ATCGCATCAATTGGGAGTACCATCTTCCTCAT6GGACCAGTGAAACAGCTGAAGCGAATGTT 

TGAGCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTT 

CTGCCTTTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCA 

TTGACGTGGTACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGC 

CGTGTGTCTTGCATAATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAG 

CTGGTGGAGAGTTTTGTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCT 

TGCAGCAATGTGTTGCTTGTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT 

TCTCGAACCTGAATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAA 

TCTTCCTCATGTACCTGTTTCCTCTCTGGATGTTGTCCCACTGAATTCCCATGAATACAAAC 

CTATTCAGCAACAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 
ACTACCATCGATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 
TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 
TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 
TATAAAGTTCC7U\CGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGC^AGT 
TACTAATGTTTTTATTACAAAAACCTACCCT/^ACCATTATACTTTGGTAACTGGCCTCTTTG 
CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 
TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 
AAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATAC 
ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 
GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 
AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 
CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAAC 
ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 
AGAACTTGACCAGTACCTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAG 
CCATCTTGCCAAAAGAAGGT7VAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 
AATCTTACTGTTTAC^AAAAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCG 
AATTCAACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATG 
ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTA 
GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 
TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 
CTCCCTGGTAGTGTTAAACCAGCAGAATATGACCAAGAGGGGTCATACCCTTATTTCATAGG 
GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 
ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCC 
TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 
AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTAAGCAGTTACAT 
ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 
CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 
AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 
AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 
CTTTATGGACAGTTATGTCTTATAACTTGATTGAAAATGACAACTTTTTGCACCCATGTCAC 
AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATTTCTAATAATCCCGAATAATGAACA 
TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 
ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 
TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 
TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 
TGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATTACATTAAAAATCAAATTGGAT 
AAAAAAAAAAAAAAAAAAA 
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OTSKFILVSFIIiAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
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ITNQRAGHTSG/^AMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPIl^GLLY 
WEDPDDMGHHLGPDSPmGPVISDIDKKLGYLIQl^KKAKLWNTLNLIITSDHGMTQCSEER 
L I ELDQ YLDKDHYTL I DQS P VAAI LP KEGKFDE VYE ALTHAH PNLTVYKKEDVPERWHYKYN 
SR IQP 1 1 AVADEGWHI LQNKSDDFLLGNHGYDNALADMHP I FLAHGPAFRKNFSKEAMNSTD 
LYPLLCHLlliNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYF 
IGVSLGS 1 1 VI VFFVI F I KHLIHSQI PALQDMHAE I AQPLLQA 
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FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 

GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 

AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 

CCAGATCCAGAGGCAACAGGGACATGGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 

AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCACGGTCGTGGGAGACGA 

CTACCATGCCTGGAACATCAACTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 

AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 

CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 

CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 

TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 

GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 

ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 

TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 

CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 

TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 

AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 

AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 

CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 

CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 

TCACTTCAGATTACAAATCACACAGAGCATCTGCCTGTTTTCAATCACAAGAGAACAAAACC 

AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVI 1 1 CLWLD ALLVLAEL I LDL 
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VLLFQEHQFEALGLLILLRLWRVARI INGI 1 1 SVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
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CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACAT 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGG TGA CGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGGCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS IFKLSVFI PSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRL^ 

SMDKNGTMT I DWNE WRD YHLLH P VEN I PE 1 1 LYWKHST I FDVGENLTVPDE FTVEERQTGMW 

WRHLVAGGGAGAVSRTCTAPIJDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARS 

NVLK I APES AI KFMAYEQ I KRLVGSDQETLRIHERLVAGSLAGAI AQSS I YPME VLKTRMAL 

RKTGQYSGMLDCARR I LAREGVAAFYKG YVPNMLG 1 1 P YAG I DLAVYETLKNAWLQHYAVNS 

ADPGVFVLLACGTMSSTCGQIASYPLALWTRMQAQASIEGAPEVTMSSLFKHILRTEGA^ 

LYRGLAPNFMKVI PAVS 1 S YWYENLK I TLGVQSR 
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FIGURE 207 

(X3AAGGCAGCGGCAGCTCCACT 

TCCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 
CAATTGC^CTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 
ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 
TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCA 
AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 
CAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 
TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 208 

MASLGQILFWSIISIIIIIJVGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVD 

QGANFSEVSNTSFELNSENVTMKWS VL YNVT INNTYS CM I END I AKATGD I KVTESE I KRR 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TC CAGC AC AC CTGG CGAG TAGT AGCTGTCAATAAATCT ATGTAAACAG AC AAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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MAASLGQVLALVLVAALWGGTQPLLKRASAGLQRVHEPTWAQQLLQEM 
LNQCGSLLYYLTLASTDLTI^ 

RHTCVSSFPEPISPEWWTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 211 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGACCCTGCCAGGCTGTCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACC ATG AAGGGCATCCTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 
TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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^4KGILVAGITAVLVAAVESLSC^QCNSWEKSCWSIASECPSHANTSCISSSASSSLETPW 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTS CRGKPWKCYEEEQCVFL VAELKND I ES KS LVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHWGSKASLYLLALASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 
GGGCTTGCCTCACTGGCC^CCCTCCC^CCCCAAGAGCCCAGCCCCATTCTCCCCGCCGCCG 
GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 
ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 
CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 
ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 
ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 
GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 
ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 
ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 
CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 
CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 
TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 
TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 
CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 
GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 
AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 
GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 
GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 
CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 
CACAA 
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FIGURE 214 

IWPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGVVINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTM 

TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKR VR I GLED I WNS LS S VFT EMQ P I DRNQR 
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CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTG 
CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 
CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 
GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 
ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGC^ 
GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 
ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 
ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 
CTTCCAGACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 
TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 
AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 
CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 
GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 
GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTG 
GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 
GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGGAATCAGAGGACAAGTTCAAGCTGGAC 
TGAAGACCTTGGAGCGAGTCTGGCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 
CTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 
TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 
CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 
GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 
CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 
GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCACCCCCCAC 
CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 
GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 
GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 
CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCG 
GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 
GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 
AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 
GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 
ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 
CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 
TGCCCCTTTGCCCGCCTCCTGCAAACCTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 
CAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 
CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCTGCTGCCAGCCCC 
TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 
CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGC 
CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 
AGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCCCTTGTCCCTCT 
TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTGGCCTGCGCTAGCTT 
CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 
CCAAGCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

MAGLAARLVLLAGAAAIASGSQGDREPVYRDOnjQCEEQNCSGGALNHFRSRQPIYMSLAGW 
TCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCR 
YRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVILHSIYLCCVR 
WGLQHPAWSAFRALLLLMLTVHVSYLSLIRFDYGYNLVANVAIGLVNVVWWLAWCLWNQR 
RLPHVRKCWWLLLQGLSLLELLDF PPLFWVLDAHAI WH I STI PVHVLFFSFLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
CTAT^CTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 
TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 
CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 
ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 
GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGT^ATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 
TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAi^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MAPQSLPSSRMAPLGMLLGLLMAACFTFCL^ 

DAEVLEWHPTHEWQALQPGQAVPAGSHWIJSFLQTGEREAKLQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRi 

RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVIAWVVTLLYDLVTEKMFAEEEAELTQEMSPEKiQQYRQVHLLPG 
LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQIX^RTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 

CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 

CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 

TTCCCACCTGACCAGCCAT^GGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 

CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 

CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 

TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 

GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 

TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 

ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 

GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 

AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 

CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 

GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 

TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 

TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 

CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 

GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 

TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 

CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 

CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 

GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 

CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 

GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 

GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 

AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 

GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 

TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 

GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 

CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 

CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 

GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 

ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 

AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MGAAWFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFF'WLVSLLLASVVWFILVHVTDR 
SDARLQYGLL I FGAAVS VLLQE VFRFAY YKLLKKADEGLASLSEDGRS P I S I RQMAYVSGLS 
FGI ISGVFSVINILADALGPGWGIHGDSPYYFLTSAFLTAAI ILLHTFWGWFFDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLP I YAVTVSMGLWAF I TAGGSLRS IQRSLLCKD 
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AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATTTCCATCCGCCAGATGGCCTATGTTTN 
TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNTCTGTTATC^TATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGJSTTGGNGCGCCCTTCCCATTTGCCTGTCCTGGT^GGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 
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FIGURE 225 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATGGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAAAA 
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MAT AR P PWMWVLCAL I TALLLGVTEHVLANND VS CDH PSNTVPS G S NQDLG AG AGE DARS DD 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIK^ 

VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 



WO 99/63088 / PCT/US99/122S2 

FIGURE 227 



ATGGTCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 

CAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGGCTGTGC 

TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACG 

GCGCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 

AAGGGCGGACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTCACCGACA 

GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 

CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 

CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 

GCACGCTGGGCCAGGGCCTCAGCGCCCTGCAGAGTGAGCAGGGCCGCCTCATCCAGCTTCTC 

TCTGAGAGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCT 

GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 

GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 

CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 

CCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 

ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 

GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 

GCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGGAGCTTCGGCG 

TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 

GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 

CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTACCGCAACT 

GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 

GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 

GCCGGTCCGGGAGGACCGCTAGACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCCCTGTCGC 

CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 

CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 

CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 

TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 

TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 

GGCCAAATACCCGGCATTATGGGGACAGAGAGCAGGGGGCAGACAGCACCCCTGGAGTCCTC 

CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 

CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 

GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 

GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 

CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 

TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 

CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 

GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 

GCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 

GCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 

ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 

CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 

CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 

TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTGGAGCCGGG 

TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 

CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 

GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 

GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 

AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 

TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG 
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MVNDRWKTMGGAAQLEDRPR 

APPPWSTGAASANSALVTVERADSSHLS I LI DPRC PDLTDS FARLESAQAS VLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSE1TOCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWSSWTGWQYSLKFSEMKIRPVREDR 
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FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 
TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 
(^CCGCTGTAGGAATCCAGATCCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 
ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 
CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 
TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 
AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 
CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 
TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 
AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCAAAAGTTGGGAG 
GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 
CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 
TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 
TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTGATTCGCC 
CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 
TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 
AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 
GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 
GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 
ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 
AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCAT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 
AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 
TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 
AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 
TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 
GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FTGURE 230 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
I GLAALGLLFFQYYQLSNTGQDT I SQMEERLGNTSQELQS LQVQN I KLAG SLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHI I I DVTSPRSRDCVAILNGMI FSKD 
CKELKRCV CERRAGMVKPESLHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTACCAGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 
GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 
AGTNTCCTCATNTACTATACACACACCACTTCCC 
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FIGURE 232 

CCCGaGCCCTCCGt»TCCGCCCCCT^ 
^ GC J GCGGCCCGCAGGGCCCGTGGCCGTGGG ^ 

AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTG^ 
^? CC J CCCTGCACAGCCC ^^ 

S GA SS TGGGCATCAGGCAGAGGCTGC ^TGGCGGTGCTGACC 
ACGG JS GGCGTGGCCGTGAACCGCACGC TGGGGCACCGGCTGGAGC^^ 
^GCGCACWGGCCGCCGGGCCC^CCT^CATGGCAGTGGTGACGCT^ 
CCATTGGACACCTGC^CCTGGCGCTGCGCCACCTGCTGGAGCAGCAC^ 



ICTG 



GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGC^^ 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGG^ 

™ ACGCTGGACTTGCAGCT ^^ 

? C ™S GCAGCTGCTCCGGCCGCTGAGCCGCGTGG AGATOT^ 

G 5 G SS ctcaggtctcactgtgctgctgg ctctagctgcggctgagc^ 

^r GGAGGCCTTTGCCACTGCAGCAG TGGAGCCTGGTGAT^TG^^^ 

tgctgctactgtatgagccgcgc^ggcccagcgcgtggcccatgcaga?^ 

GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCC?^ 

IS GCAGACAGC ^ 

TCGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACG^ 

^ GCGC f GCATGCCATCTCGGGG TGGCAGGCCTTCTTTCCCATG^^^^ 

^ A Sf GTGGCCCCACCACAAGGGCCTGGGCCCGG AGAGCTGG^ 

TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACG^ 

CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTG^^ 

gttcctccacttctccagtctgcatgtgctgcgggcggtggagScggc^ 

GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCra^^ 
CAACAGCACC^CCCCACCCTGTCCCCGTGGGCCGTGGCATGGCcIcIrc^ 

AAGCTGGCCC^CTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCT^ 

G S GA T CGGGCTGTGGOT ^ 

^^ CCCTGGGGGCTGGGCTGTAGAAGAGT TGTTGG^^ 
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FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRY I STELG IRQRLLVAVL 

TSQTTLPTLGVAVl^TLGHRLERWFLTGARGRRAPPGMAVVTLGEERPIGHI^L^RHLLE 

QHGDDFDWPFLVPDTTYTEAHGLARLTGHLSLASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 

VLLSRMLLQQLRPHLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 

VQEGDPHFRSALTAHPVRDPVHMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRA 

AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 

RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 

LPVPYVTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 

HADVFAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVL 

TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 

DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 

TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 

AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 

ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 

CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 

GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 

AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 

GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 

TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTCATGGCGGCTGGGCTGTTTGGTTTGAGC 

GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 

ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 

CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 

GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 

CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 

GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 

CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 

ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 

CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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FIGURE 235 

MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTVVAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDP VLDVWNMRLVFFFGVS 1 1 LVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 
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FIGURE 236 
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GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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FIGURE 237 

GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 

TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 

ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 

TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 

GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 

AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 

TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 

CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 

ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 

CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 

TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 

CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 

ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 

GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 

ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 

CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 

CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 

ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 

AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 

GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 

ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 

GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 

CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 

GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 

GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 

GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 

GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 

GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 

CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 

TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 

CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 

CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 

AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 

GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 

TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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FIGURE 23K 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 

PJLFPKALGQLISKYSLRELHLSFTQGFTOTRYWGPPFLQAPSGAELWVWPQDTVTDVDKSWK 

ELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREWCTENLTPWK 
KLLPCSSKAGLS VLLKADRL FHTS YHSQAVH I RPVCRNARCTS I S WELRQTLS WFDAF I TG 

QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVSIQFERALLKWTEYTPDPNHGFYVSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVI CLTCTWAVCYGS FYNLLTRTFH IEEPRTGGLA 
KRLANL I RRARG VP PL 
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CAACATGGGGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 

CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 

TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 

TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 

ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 

TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 

TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 

CCACCAAAGAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAAGAGGGAAGAGTCACAAAAAG 

TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 

GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 

TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 

CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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FIGURE 240 

MGSSSFLVLMVSLVLVTLVAVEGVKEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVIPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 
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FIGURE 241 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 

AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 

TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 

CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 

TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCT 

CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 

TGCAGAGCCAGCGCCAGAGAACAATGGCCTCAACACACAAACCCAGCCCAAGGCCCACACCA 

CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 

ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 

ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 

AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 

GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 

GCACAGAATGCTGGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCA 

CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 

AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 

GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 

TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 

TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 

CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 

TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 

GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 

CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 

GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 

CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 

CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 

TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 

TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 

TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 

AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 

CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAAACCATGACTTCAAGCTGGA 

GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 

CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 

CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 

GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 

GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 

GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 

CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 

GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 

AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 

TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 

GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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FIGURE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 
KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 
KAATTAKTL I PKSQHRMLAPTGAVSTRTRQKGVTTAVI PPKEKKPQATPPPAPFQSPTTQRN 

QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQK\ArrRFPPVPQQQLLLASLPAGSLRCITCAWGNGG 
ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFL 
RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRL I FYINHDFKLERE VWKRLHDEG I IRLYQRPGPGTAKAKN 
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CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGCAGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 
TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTATCACAACA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GTCAGTCCAGTGTGCATGGAGGATAAGTGAGCAGACCGTACAGGAGCAGCACACCAGGAGCC 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACTTATTT 
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FIGURE 244 



MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCTI PAYYKRCARLLTRLAVSPVCMEDK 
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FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 

CCCGACCCCGGCCGCGCCCAGCCCCCACCATGCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 

CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 

GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 

GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 

CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 

CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 

ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 

CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 

TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 

CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 

TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 

CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 

GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 

ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 

GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 

CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 

TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 

CTAGATTAAAGCTGTAAAGACAAAA 
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FIGURE 246 

MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 

CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 

FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 
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FIGURE 247 

GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 

CCCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATG 

TCATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 

CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 

CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 

TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 

TAGTCACCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGG 

AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 

CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 

TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 

CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 

AAAGCAAGTTTCTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCC 

ATTGGTGAAATCTGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCACTGTGA 

AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 

AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 

ACTGGATGCTGCGGTGACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCC 

TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGA 

CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAA 

GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 

CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 

TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 

CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 

CTCAGATCCGGGAGATGTTTGATGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATG 

CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 

TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 

ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 

TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 

TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 

AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 

AGCAAATCCAACATGGTCCATCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 

AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 

AGGATGATGGATGGGACTCTTTGACTGGCCTTTTAAAAGGAACACACACAGCAGTCAGCAGT 

AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 

TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 

AAGGTTTGAATGAGCTGATTCCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 

GAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 

GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 

TGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGGCTATTTCAGAAAGTGGAAGGAATCC 

AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 

CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 

GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 

GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 

CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGAAAAACTGGAACAAACTTG 

TACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTACAACAAATCAA 

TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 

TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 

ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATGTAAAAA 

TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 

ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 

CTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTTCATGAATGGGCTA 

TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 

TCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 248 

MVFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEAS PKRSDGTPFPWNKI RLPE YVI PVH 
YDLLIHANLTTLTFWGTTKVEITASQPTSTIILHSHHLQISRATLRKGAGERLSEEPLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 
ARMAFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVA 
F 1 1 SDFES VS KI TKSGVKVS VYAVPDKINQADYALDAAVTLLEFYEDYFS I P YPLPKQDLAA 
I PDFQSGAMENWGLTTYRESALLFDAEKSS ASS KLG I TVTVAHELAHQWFGNLVTMEWWNDL 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVS YDKGACI LNMLREYLSADAFKSGI VQYLQKHS YKNTKNEDLWDSMAS I C PTDG VKGMDG 
FCSRSQHSSSSSHWHQEGVDVKTMI^rWTLQRGFPLITITVRGRNVHMKQEHYMKGSDGAPD 
TGYLWHVPLTF I TSKSNMVHRFLLKTKTDVL ILPEEVEWI KFNVGMNGYY I VHYEDDGWDSL 
TGLLKGTHTAVSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIMPVFQGLNELIP 
MYKLMEKRDMNEVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCV 
QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 

SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 
VWLQSEKLERM 
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CAGCCACAGACGGGTCATGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 

TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 

GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 

CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 

AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 

TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 

TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 

AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 

GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCC 

CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 

ACTGCAATAGGAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAAC 

TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 

GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 

GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 

GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 

CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 

CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 

GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 

GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 

GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 

GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 

GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 

CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 

CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 

ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 

GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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FIGURE 250 



MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCPVCLSMEGCLEGTTEE I CPKGTTHCYDGLLRLRGGG I FSNLRVQGCMPQPGCN 

LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 
LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VAQPSSFLLNHTRQIGIFSAREKRDVQPPASQHEGGGAEGLESLTWGVGLALAPALWWGVVCPSC 
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FIGURE 251 

GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 

CAGGATGAGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCT 

CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 

CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGAGTCG 

GCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGACAGAAAGGCAGTGTGGGTCGTCAT 

GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAGGACCCCC 

TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 

AGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCAAGTTCATCAAGAATGCTGTCGCC 

GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 

CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 

ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 

GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 

GTGGCGCAGCGGTGAGCCCAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 

CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 

GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 

GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 

GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 

AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 

ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 

TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 252 

MRGNLALVGVLI SLAFLSLLPSGHPQPAGDDACS VQ I LVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKF I KNAVAGVRETE SK I YLL VKEE KR YADAQLSCQGRGGTLS MPKDEAAN 
GLMAAYLAQAGLARVF I G INDLEKEGAFVYSDHS PMRTFNKWRSGE PNNAYDEEDCVEMVAS 
GGWNDVACHTTMYFMCEFDKENM 
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FIGURE 253 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 

CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGACCACC 

CAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATGAGGATCATGCTGCTATTCACAGCCAT 

CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 

TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 

AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 

GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 

GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 

CTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 

TTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGT 

ATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 

TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 

AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 

TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 

CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 254 



MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 
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FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGCTGCAGGAGTGGAGCCATGAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCTCATGCTGCACAACAAGCTTCGGGGCCAGGTGCAGCCTCAGGCCT 
CCAACATGGAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCGTG 
AAACCTTAGACTCCCGGGGTTAAGCGATCCTGCTTCAGCCTCCCAAGTAGCTGGAACTACAG 
GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGATGGGGGTCTTGCTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAA 
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FIGURE 256 

MSCVLGGVIPLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNESHSRVRRAIPREDKEEILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGG 
GTCTGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 
GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 
CCTCCAC<^TGCAAGATCTC^CACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTG 
TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 
CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 25ft 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARS QHHWCNT 
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FIGURE 259 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 
CAAAAACAACAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGCAGT 
TATGATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 
TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 
TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCTAATCAATGCAGAGTTCCCCTCCC 
CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGGAAAAAGGCATGTATTTAAATCTG 
TATGATTCTCAACCATCTTTAGTTGGGAAAGGTCCTTGAAAGCCAATGGAAATACTTTTTTT 
TTTTCTTGGCACTAATCAAGTGAGTGTTACCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 
GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 
ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 
TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 
AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 
TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 
ATAAAAGTTCATATCTACCC 



W ° 99/63088 PCT/US99/12252 



FIGURE 260 



M I G YYL I LFLMWGS STVFCVLL I FT I AEAS FS VENECL VDLCLLR I CYKLSG VPNQCR VPLP 
SDCSK 
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FIGURE 261 

GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTT 

GAGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTT 

CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 

CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 

AAAAATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATA 

TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 

GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 

ACGAGTGCATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAA 

GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 

TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 

ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAAC 

ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 

CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 

CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCA 

GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 

GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 

CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 

GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 

GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 

CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGAT 

AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAA 

GCTGCCTCTGAAGTAATGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 

TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAG 

AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 

CAAGAGCTTAGATGAGAGCATATCATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAA 

CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 

GCTCCCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 

CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 

CAAATGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 

CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 

AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 

CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTTAACCTCTATTTCCCCTAGCCCTGTC 

CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGAATATCGCTTT 

CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 

TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGATACAGC 

TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 

TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 

TTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATAATTCCTCC 

AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGCTGTCCTCTGT 

TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 
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FIGURE 262 
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MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
FSKN I S VMLGRCQT YTSE YKSAVGNLALRVERAQRE I D YI Q YLREADE CIVS EDKTLAEMLL 
QEAEEEKKI RTLLNAS CDNMLMG I KS LK I VKKMMDTHGSWMKDAVYNS PKVYLL I GSRNNTV 
WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVED 
RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 

PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQLYAWNEGNQ I I YKLQTKRKLPLK 
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FIGURE 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGGCAGTGGTTCCACCAACATGGAGCTCTCGCAGA 

TGTCGGAGCTCATGGGGCTGTCGGTGTTGCTTGGGCTGCTGGCCCTGATGGCGACGGCGGCG 

GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 

AAATGGATTTCCACCTGACAAATCTTCGGGATCCAAGAAGCAGAAACAATATCAGCGGATTC 

GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 

CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATAGCTGGCTACCTGTGC 

AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 

TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCTTCAGCCCTGACTGCAGAGCC 

TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 

TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 

TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 

CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 

ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 

AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 

CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 

TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACACAGATGTGGAATACAAGAAGAAGC 

AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 

CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 

CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 

ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 

TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 

CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 

AGAGCCTGGGTGCCCTGAAGAAGTGACTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 

GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGAAGG 

AGTCTCCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 

AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 

CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 

AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCAGCACATCCTCACACCCAAAGAAG 

TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 

TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 

TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 

AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 

AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 

TTAGAAACACATTCCTTGGGAAGGCAAAGTTTTCTGGGACTTGATCATACATTTTATATGGT 

TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 

TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 264 

MELSQMSELMGLSVLLX3LLALMATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKKQK 
QYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYLATCADDRTIRIWSTKDFLQ 
REHRSMRANVELDHATLVRFS PDCRAF I VWLANGDTLR VFKMTKRE DGG YTFTATPEDFPKK 
HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 
FTPDVKVWEVCFGKKGEFQEVVRAFELKGHSAAVHSFAFSNDSRRMASVSKDGTWKLWDTDV 
EYKKKQDP YLLKTGRFEEAAGAAPCRLALS PNAQVLALASGSS IHLYNTRRGEKEECFERVH 
GE C I ANLS FD I TGRFLAS CGDRAVRLFHNTPGHRAMVEEMQGHLKRASNE STRQRLQQQLTQ 
AQETLKSLGALKK 
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TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 

CAGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTCAATCATGCTCC 

TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 

GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 

GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 

AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 

CGCTGCTCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCA 

CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGAC 

TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 

GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGG 

CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 

AATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 

CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 

GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 

ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 

TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 

GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 

CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 

TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 

TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 

AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 

TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGG 

CAACTACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 266 

MRGATRVS IMLLLVTVSDCAVITGACERDVQCGAGTCCAI SLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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FIGURE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATCTCTTTC 

CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 

TGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 

CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 

GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 

CCTCTGTCGAGApGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 

TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 

TTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 

GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 

GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 

GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 

TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 

TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 

GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 

TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 

CAAAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 

TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 

TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 

TCTACTAAAAATACAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 

GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 

CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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FIGURE 268 

MS ^QDPSFFTMGMWS IGAGALGAAALALLLANTDVFLS KPQKAALEYLED I DLKTLEKE PR 

TFKAKELTOKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDF 
QP YFKGE I FLDEKKKF YGPQRRKMMFMGF I RLGVWYNF FRAWNGGFSGNLEGEGF I LGGVFV 
VGSGKQG I LLEHREKEFGDKVNLLS VLEAAKM I KPQTLAS EKK 
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FIGURE 269 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 
GGCCAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGA 
AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTG 
CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 
GACCACTTCTGCT^AATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATG 
GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 
CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 
CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 
ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCC 
CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 
TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 
GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 270 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIV 
VFSLLAALLIAVGLALLWKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 
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FIGURE 271 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 

TTGGGATTTTAATTTTCAAACACAGCAGAATGACATTTTTTCTGTCACTATTATTATTGTTG 

GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 

ATCAAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTC 

CTACCAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 

GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAGATTCAATG 

ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATA 

TATACAATAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACT 

TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 

TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAA 

ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 
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FIGURE 272 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
GIVKGRNLDSRGLILGAEAWGRGVKKNT 
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GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTPT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 
TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAA^^^TAGA^GGA^T 
ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTOTTCAAAAA 

AAAACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCA 
TACACCAAGCAGTTCACAGAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCT 
TCTACTTGGAAAAAAACAAAATGAATATGGACCACCAGGCAAACTGTTTGTCCATGAGTGGG 
CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 
TCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 
GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 
GAAAA 3 TTG ^ CAA ^ CTTTCCT ^^ 

CAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 
CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCTGAGGATT 
TTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATC 

A ?IS^^ TTGTGTGCTTAGTTCTTGATAAGTCTGG ^ 

AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATCGATCCTGGG 
TGGGGATGGTTCACTTTGATAGTACTGCC^CTATTGT^^ 

AGTGATGAAAGAAACACACTCATGGCAGGATTACCTACATATCCTCTCGGAGGAACTT^T 
CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACAT^ 

CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGA 

AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAA 

AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCT^ 

GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCT^CCAGAAGTCCC 

CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAA^ 

TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAA^AGTCTGCCTCCCAOT 
TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGl^GATGC^CT 
ATGGCCTATCTCAGTATTCGAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 
CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCA^ 

CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 
GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGAcS^ 
TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 
CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATA^CAGAA^ 

AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTC^ 
GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGA^ 

GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 

GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAA 

AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAG 

GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGC^^ 

GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAAT 

(^ccaactccaaggaaagctttgcatttaaaccagaaaatatctcagaagaaS 

ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTAT 
GC^CAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACA^^^ 

tactcctactcctactcctgataaaagtc^taattctggagtt^ 

TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTA 

ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTIAAAAAACA^ 

AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAAAT 

TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCAI^GATATGTAAAAAOT 
CAAGATTAAAATTTAATAGTTTCATTTATTTC^^ 

AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGAT^^ 
GATATTTC^TTGCATCAAGAAATTAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^^^^^^^^^^ 
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FIGURE 274 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKI IEQIEDMVTTASTY 

LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 

GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 

CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 

FCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 

LDKSGSMGGKDRI^RMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLM 

AGLPTYPLGGTS ICSGI KYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 

FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 

LNSNAWMNDTVIIDSTVGKDTFFLITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPG 

TAKVGTWAYNLQAKANPETLT ITVTSRAANSS VPP I TVNAKMNKDVNS FPSPMI VYAE I LQG 

YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 
GANTARLKLRPPLNRAAYIPGWWNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFWSQV 
PSLPLPDQYPPSQI TDLDATVHEDKI I LTWTAPGDNFDVGKVQR YI IR I SAS I LDLRDS FDD 

ALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIP 
QANPDD IDPTPTPTPTPTPDKSHNSGVNI STLVLS VI GS WI VNF I LSTTI 
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FIGURE 225 

CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGG 

GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGAC 

GCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCC 

AGTTATGCGAGGATTTACTAGAGAGTGTCAACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGG 

JT^^^^ CCTTTGTC ^ GGCTGC GTTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGAGGGTG 

GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAA 

GGTGAAAACTTGGAGATTTCACTTC^GTCATTGCTTCTGCCTGCAAGATCATCCTTTAAAAGTAGAGA 
^^"GTGTGGTGGTTAACTCC^^ 

TCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCT 

TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCT 

TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGT 

TGGCCTGCACCCCAAAAGGTGACGAGGAGCAGCTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAG 

GAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGAGCAGCCTGAAGCG 

GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAG 

CCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTCCTGGCC 

TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGC 

AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCG 

AGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACC 

CTGAACAATCCTGCAGAGAACAGCCCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGAT 

CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAAT 

TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATG 

GCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTCCGGCAGTTCATCCA 

GAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGGAAAG 

AAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTCAGGAACTTTACC 

TTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGG 

AAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTA 

GGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATA 

^^ CGGCCACCATGATGCAGTG CCTCCCTTGGAACAGCAGCTGGTCATAAAGAAGGAAA 

TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTG 

ATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTCCACAGCAACCTC 

ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAGCT 

GACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGG 

GCATGCTGGTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAA 

AAAACATGAACTCCCAGAGAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTG 

GCTGCAACAGAGAAAAGACTTCCATAAAGGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAA 

AGGG ^ CCGATTOCTCTCTGTTGGGG TTTTTACAACAGAAATCAAAATCTCCGCTTTGCC 

AACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTG 

TGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATG 

ATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 

GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTA 
^ G ^T ACTAGTGAGTACATTAAGTA ^ TAAA ATGGACCAGAAA^ 

TGTCATATTTTCCCCAAGATTAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCT 

CCGTTTTTTTCTTTTATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTA 

CCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTT 

GAGA ^ C 3JI ATGAGAACmCAG ^ CAAAGCATCAA ATTGATGCCATATCCA^ 

CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGA 

CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATG 

A ? A ^^ GOTTACAGAAAAGGAAACTCATTCA ^^ 

G J G ^ CCAAACAATCTCTmCAA ^^ 
?^ GAAAAATATATATATATA ™^^ 

GAAG IEJT TGCTACATGTTATCCAC CCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGC 

AG ^ C T ACTCAATCACCAAGATGC ^ CTGAAAATTGCA TTTTATTACCATTTCAAACTACT 

^ A ^I ACAGTTAACATAGAGTGGTCTCTTCATTCATGTGAAAAT ^ 

GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAA 
GAA ^J CAAGCTGTTGGTGTGTTAAAAAATGCA ^ GTATTGA TT^ 

AATTAAAACACAGGCCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATATGAA 
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FIGURE 276 

MMMVRRGLLAW I SRWVLLVLLCCAI S VL YMLACT PKGDEEQLALPRANS PTGKEG YQAVLQ 
EWEEQHRNYVSSLKRQ I AQLKEELQERSEQLRNGQYQASDAAGLGLDRS PPEKTQADLLAFL 
HSQVDKAEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPI 
MKVKNEKLNMANTLIWIVPIAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVK 
GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRDFGFGMTCQYRSDFI 

NIGGFDLDIKGWGGEDVHLYRKYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 
KAMNEASHGQLGMLVFRHE IEAHLRKQKQKTSSKKT 
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FIGURE 277 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACC 

AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCAT 

ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 

GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 

ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 

AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACT 

CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 

GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 

CAGGGATCTGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGAT 

AAGTGTGAAAACATGATCACAATTGAAAATGGCATCCCCTCTGATCCCCTGGACATGAAGGG 

GGGCATATTAATGATGCCTTCATGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 

TGTTCTGCTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATA 

CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 

TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 

TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 

TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 

AAAGACCTAAGGAAAAATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCAT 

TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAG 

TGAGAAGTAATTATTGTAAATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATT 

TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 

AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAG 

TAATAATCATCTCTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 278 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVP 
NRFATEISHVLLCNVTQRVSFWFVWDPSKNHTLPAVEVQSAIRMNKNRINNAFFLNDQTLE 
FLKIPSTLAPPMDPSVPIWI I IFGVIFCI I IVAIALLILSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGI PSDPLDMKGGILMMPS 
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FIGURE 279 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 

GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGT 

ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 

CCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGC 

TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 

ATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 

CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 

TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 

AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 

CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 

AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 

ATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCT 

GTTTATTTAGAAGACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 

GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 

TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 

AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 

GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAA 

GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 

AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 

ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 

TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 

TCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATT 
TTTGTCG 
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FIGURE 280 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVT 

WNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQ 

VKNPPDVDGVIGEIRLSWHTVRFSEIHFLALAIGSACALMIIIVIVWLFQHYRKKRWAER 
AHKWE I KS KEEERLNQEKKVS VYLEDTD 
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FIGURE 281 

GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 

ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTG 

ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 

GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 

TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCCT 

GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTC 

ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 
ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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FIGURE 282 

MKFLAVLVLLGVS I FLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKDIPVLPKWVGDLPNGRVCP 
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FIGURE 283 

GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 

CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAATGC 

TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 

GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 

ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 

GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCGAGTCCAGGGTGGGGGGCG 

GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCGGCCACAGAGTCATCTGCCCAGCTCCT 

CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 

ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTCTTGCTCTCAGGA 

CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGAAAT 

GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 

CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 

TACAAAAAATTAGCCAGGCACAGTGGTGTGCACTGGTAGTCCCAGTTACTCGGGAGGCTGAG 

GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 

TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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FIGURE 2X4 

MLPPALPPALVFTVAWSLLAERVSWVRDAEDAHRLQPFVTERTLGKVQRWSGVHTQTGGRAG 
GGQFCCAWLDSKRVLASPGWGAANSIKNQRVWAPATESSAQLLCCWPVGVARGGALCQ 
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FIGURE 285 

GTCATGCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCGGCCTGCCTCA 

GCGGCCCCCATGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 

GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 

AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 

CGGGGCCGGGATGCAGCCCAGGAACTTGGGGCAAGCCTGTTGGAGACTCAGATGGAGGAGGA 

TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 

AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 

GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 

CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 

AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCCTGAATCTGCCTGGATGGAACTGAG 

GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 

CCTGTTCACTGGGATCAGCCAGGGCGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 

GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 

CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCGTGGCATTTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 286 

MPVPALCLLWALAMVTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVYRTTEGRLTK 
ARNSLGLYGRTIELLGQEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLRDS VQRLE VQLRSAWLGP AYRE FE VLKAHADKQS H I LWALTGHVQRQRREMVAQQHRLRQ 
I QERLHTAALPA 



WO 99/63088 / 

-^^J /^X O PCT/US99/12252 

FIGURE 287 

GGCAACATGGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 

CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGCCAGGA 

AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 

TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGTCTGTCTCCGAGGCAC 

TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 

AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 

CTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 

CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 

ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 

GGCAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATGCGAGTTCACCATCCC 

TAAATAGGTCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 

TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATTGCAACACAAGATCAAT 

GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAACACATTTCTTTGGGATTTTGCCCT 

TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 

AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 

AAATCACAGTAGCAATGCAACTCATCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 

TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 

GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 

TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 

GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 

TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 

CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 

CAAAGAAGCAGATTGCATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 

AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACTTACATTTCCTTTTTTACATTT 

TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 

TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 

TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 

TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 

AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 

CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 

ATAATAAAGCCTGAATTCTGATC^^TAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 288 

MAQQACPRAMAKNGLVICILVITLLLDQTTSHTSRLKARKHSKRRVRDKDGDLKTQIEKLWT 

EVNALKEIQALQTVCLRGTKVHKKCYLASEGLKHFHEANEDCISKGGILVIPRNSDEINALQ 

DYGKRSLPGVNDFWLGINDMVTEGKFVDVNGIAISFLNWDRAQPNGGKRENCVLFSQSAQGK 
WSDEACRSSKRYI CEFT I PK 
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FIGURE 289 



GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 

CCCGAGCCCCCGCGCCATGAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 

GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 

GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 

GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 

GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 

GCCCTGACAGTGTTTGGCTGAGCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 

CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 

CTCAATAAACGTGGTTAAGAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 290 

MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLLLS 
SLGI PVNHL I EGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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FIGURE 291 



TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 

CCTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 

GCACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 

TCAGAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 

CCAGCGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTT 

GCCAAGGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGG 

CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCA 

CAGGGCCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAG 

CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 

GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 

TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 

CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 

GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGA 

AATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTAC 

AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 

TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 

TGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACA 

TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCA 

GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 

CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 

ACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTAC 

TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 

AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGG 

ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 

TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 

AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 

CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 

AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 

TGAGTTACTTGAACGCAAGCACTGGAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 

CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 

CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 

TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAATGTTTTTGGACCATGGT 

TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 

GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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FIGURE 292 

MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKAGE 
EEKAWLMASRQQLAKETSNFGFSLLRKISMRHDGNMVF^ 

KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 

DTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDP 

VFTEVDTFHLDKYKTIKVPM^GAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKMGDHL 

ALEDYLTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSA 

TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLF 
LGRWNPTLL 
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FIGURE 293 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGACATGAG 

GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAG 

CACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAGAAG 

GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 

TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 

TCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 

GAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCC 

CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 

TCTACCACCCCCAGTAGGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 

CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 294 

MRRLLLVTS LVWLLWEAGAVPAPKVP I KMQVKHWP S EQD PE KAWGARWEPPEKDDQLWL 
FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 
RPRLWVMPNHQVLLGPEEDQDH I YHPQ 
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AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTG 

TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGC 

TGTTTCTCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAA 

TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 

TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 

GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 

CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCC 

AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 

ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 

CCCAATAAGTCCCCCATGCAGCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 

TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 

ATGGAGAAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 

GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 

TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 

TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 

CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 

CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGTTGAGAGTTTTGTG 

GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 

GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGAAAAAAA 
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FIGURE 296 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGI 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRVTGCNTEHHCIGGGGYFPEAS PQQCGDFSGFDWSGYGTHVGYSSSRE I TEAAVLLFYR 
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FIGURE 297 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 

CACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCG 

CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 

ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAAC 

AGAGACTCTCCAACATGTGCCTTCTGACCATACAAATGAAACTTCCAACAGTACTGTGAAAC 

CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCG 

GCATCTAA.TACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTC 

TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCG 

TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 

TCTGAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 

GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 

GGTATCGAACCATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGA 

ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 

CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 

AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGT 

TCATACAATGGTTTTAGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCT 

GGGGTGGGGGCATTGGTCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGAA 

TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 

AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 

CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 
TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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FIGURE 29S 

MGLGARGAWAALLLGTLQVLALLGAAHESAAMAASANIENSGLPHNSSANSTETLQHVPSDH 
TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 
TSQ I S TSTMTVTHNSSVTSAAS S VTI TTTMHS EAKKGSKFDTGS FVGG I VLTLG VLS I L Y IG 
CKMYYSRRGIRYRTIDEHDAI I 
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FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 

TCGCGGGGGCTCCGGGCTGTGGGACCGCTGGGCCCCCAGCGATGGCGACCCTGTGGGGAGGC 

CTTCTTCGGCTTGGCTCCTTGCTCAGCCTGTCGTGCCTGGCGCTTTCCGTGCTGCTGCTGGC 

GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCTCCCT 

ATAAAGAAAATTCTGGGCATATTTATAATAAGAACATATCTCAGAAAGATTGTGATTGCCTT 

CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 

ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACCATTATAATTTATCTCTCCA 

TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 

CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGATATTGGGGATCACCAGCCTTT 

TGCAAATGCACACGATGTGCTAGCCCGCTCCCGCAGTCGAGCCAACGTGCTGAACAAGGTAG 

AATATGCACAGCAGCGCTGGAAGCTTCAAGTCCAAGAGCAGCGAAAGTCTGTCTTTGACCGG 

CATGTTGTCCTCAGCTAATTGGGAATTGAATTCAAGGTGACTAGAAAGAAACAGGCAGACAA 

CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 

GTTGCTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 

ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 

TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 

AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 

TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 

AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 

TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACTTACTCATTTG 

ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 

CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 

CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 

GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 

TGCAATAAAGAAATTTTATTTTAAACCCAAGCCTCCCTGGATTGATAATATATACACATTTG 

TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 

AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 

TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAATAATTTTTGAAACATCAA 
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FIGURE 300 

MATLWGGLLRLGSLLSLSCLALS VLLLAQLSDAAKNFEDVRCKC I CPP YKENSGH I YNKNIS 
QIO)CDCLHVVEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTI 1 1 YLS I LGLLLLYMVYLTL 
VEP I LKRRLFGHAQL I QSDDD IGDHQ PFANAHDVLARSRSRANVLNKVE YAQQRWKLQVQEQ 
RKSVFDRHWLS 
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FIGURE 301 

GCACCTGCGACCACCGTGAGCAGTCATGGCGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCTTCCTGTCCCGCGGGAAGCGG 
CAGGAGCCGCCGCCGACACCTGAAGGAAAATTGGGCCGATTTCCACCTATGATGCATCATCA 
CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 
CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 
ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTACATTCTATT 
TAAGGTAAGTAGAATCATCCTAATCATATTACATCAATGAAAATCTAATATGGCGATAAAAA 
TCATTGTCTACATTAAAACTTCTTATAGTTCATAAAATTATTTCAAATCCATCATCTCTTTA 
AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 
TTTACTCAATGTTTAAGTGTTTTGCCCCAAAATTCACAACTAACAAGGCAGAACTAGGACTT 
GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATACAATTCAATGCACTCCCCTGCCA 
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FIGURE 302 

MAYSTVQRVALiASGLVLALSLLLPKAFLSRGKRQE P PPTPEGKLGRF P PMMHHHQAPS DGQT 
PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQ 1 1 PI YGFG I FLYILYI LFKVSRI I LI 
ILHQ 
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FIGURE 303 

CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 

GGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTAC 

TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 

GGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGAT 

AGACTGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCA 

ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 

AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 

AATCCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAG 

AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 

AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 

GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 

GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 

CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 

GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 

CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 

TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 

GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAG 

AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 

CGGGAGGTGATCGAGGAAGAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCA 

CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 

GAATGCCAAAAACACAGCAAGCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGG 

TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 

CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 

CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 

CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 

GGATCAGACCCTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATA 
AAAACCAACCCAAATCAA 
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FIGURE 304 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCWQSTEDKCIFKIDWTLSPG 
EHAKDE YVLYYYSNLS VP IGRFQNRVHLMGD I LCNDGS LLLQDVQEADQGTYI CE IRLKGES 

QVFKKAVVLHVLPEEPKEI^VHVGGLIQMGCWQSTE^KHVTKVEWIFSGRRAKEE 
HKLRJMSVEYSQSWGHFQNRVNLVGD I FRNDGS I MLQGVRESDGGNYTCS IHLGNLVFKKTI V 

LHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTV 

LVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLR 
SDRNNSLEKKSGGGMPKTQQAF 
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FIGURE 305 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGGAAAACAAATGTGTCCCATCTCACATG 

GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 

AAACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATGCA 

GGATGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTG 

GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 

ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 

TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 

TAAAACAATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 

TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 

TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 

AGTACATCAAAGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAAT 

GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 

TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 

AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 

TAATGCAAAGAGGTGGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 
ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 
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FIGURE 306 



MQDEDG YI TLN I KTRKPALVS VGPASSS WWRVMAL I LL I LCVGMWGL VALG I WSVMQRNYL 
QDENENRTGTLQQLAKRFCQYWKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWE 
ESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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(54) Title: MEMBRANE-BOUND PROTEINS AND NUCLEIC ACIDS ENCODING THE SAME 

(57) Abstract: The present invention is directed to membrane-bound polypeptides and to nucleic acid molecules encoding those 
polypeptides. Also provided herein are vectors and host cells comprising those nucleic acid sequences, chimeric polypeptide 
molecules comprising the polypeptides of the present invention fused to heterologous polypeptide sequences, antibodies which bind 
to the polypeptides of the present invention and to methods for producing the polypeptides of the present invention. 
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invention 3 is limited to SEQ.IDs. 7 and 8, invention 4 is 

limited to SEQ.IDs. 13 and 14 and invention 135 is 

limited to SEQ.IDs. 423 and 424. 
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